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PREFACE 


T he first two editions of Funda¬ 
mentals of Learning and Mem¬ 
ory were well received. Each 
provided up-to-date information 
about learning and memory, using 
both human and animal'data as a ba¬ 
sis for analysis. This third edition 
promises to be even more effective in 
clearly and concisely examining the 
pertinent research and issues in the 
field. I feel confident in making this 
claim for several reasons, all of them 
having to do with the improved qual¬ 
ity of this edition. 

1. Organization. The basic organ¬ 
ization of the text around the critical 
concepts of acquisition, transfer, and 
retention has not changed. However, 
the grouping of information into spe¬ 
cific chapters has been altered some¬ 
what. The chapters on contiguity and 
practice (Chapters 5 and 6 in the sec¬ 
ond edition) have been combined into 
one chapter (Chapter 5). Similarly, 
the chapters on generalization and 
transfer (Chapters 8 and 9 in the last 
edition) have now become Chapter 7. 
While I continue to believe the issues 
covered in these chapters are crucial, 
the reorganization streamlines the 
text and allows the inclusion of excit¬ 
ing new developments in the field. For 
similar reasons, I have eliminated 
chapters on the physical basis of 
learning (Chapter 16) and behavior 
modification (Chapter 17). I have not, 
however, omitted the information. It 
has been blended into the text at ap¬ 
propriate points, thereby accentuat¬ 
ing its relevance and making the con¬ 
cepts under consideration easier to 
grasp. 

2. Length. Many of the structural 
changes in this edition have been un¬ 
dertaken in an attempt to shorten the 


text. The second edition was essen¬ 
tially an expansion of the first—little 
was deleted and a good deal was 
added. Several reviewers suggested 
that I trim the third edition a bit, and 
I have taken their advice. Instead of 
simply adding new material, I have 
tried to implement a judicious selec¬ 
tion process, eliminating the less im¬ 
portant and incorporating the more 
significant. The result is, I hope, a 
superior edition that is both succinct 
and comprehensive. 

3. References. As was true of the 
earlier editions, the third edition of 
Fundamentals of Learning and Mem¬ 
ory has been completely updated. 
Older, less crucial sources have been 
deleted and new references have been 
added. Among the new references 
are several hundred citations that 
postdate the second edition. These 
changes give the book a fresh feel and 
ensure that it will continue to be one 
of the most up-to-date texts in the 
field of learning and memory. 

4. Emphasis on Human and Ani¬ 
mal Learning. Once again this text has 
been written for those who believe a 
course in learning should attempt to 
identify the principles of learning and 
memory that are universal, regardless 
of the species under study. Thus data 
from both human and animal re¬ 
search have been included. 

5. 'Writing Style. In each of the edi¬ 
tions of Fundamentals , I have attempt¬ 
ed to create a text that is not only 
stimulating and engaging but also 
easy to read and comprehend. Given 
the complexity of the field, this has 
been a sizable undertaking. How¬ 
ever, if student feedback is any indi¬ 
cation, I have been successful. Part of 
this success stems from my effort to 



deal with issues rather than simply 
listing an endless series of empirical 
facts. While this edition, like the ear¬ 
lier ones, is geared to upper-division 
students, lower-division and graduate 
students should also find it helpful. 

Many people have provided me 
with invaluable assistance in complet¬ 
ing this edition. In particular, I would 
like to thank the following reviewers 
for their suggestions and guidance: 
Delos Wickens, Colorado State Uni¬ 


versity; Vincent Schulte, Northern 
Kentucky University; Edward Gavur- 
in, Lehman College, CUNY; and 
Harry Steger, Boise State University. 

I would also like to thank the peo¬ 
ple at Academic Press. Special thanks 
go to Warren Abraham, my editor, 
and Lenn Holland, this edition’s pro¬ 
duction supervisor. Without their 
careful attention, the writing and pro¬ 
duction process would have been 
much more difficult. 
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PART 

ONE 

INTRODUCTION 


T his section introduces you to the 
language of the field of learning 
and to many of the basic condi¬ 
tions and variables that psychologists 
in the field consider. In addition, it 
examines differences among the many 
available ways of learning, or learn¬ 
ing tasks. Finally, it discusses the con¬ 
straints that natural selection places 
on learning processes. 

Chapter 1 brings the field of learn¬ 
ing into focus in two ways. First, we 
consider a definition of learning. 
Although such a definition is helpful, 
it is probably impossible to encom¬ 
pass all that is meant by learning in a 
single sentence. Accordingly, our 
second attempt to circumscribe the 
field of learning involves briefly look¬ 
ing at a number of the different learn¬ 
ing tasks employed and investigated 
by psychologists. These tasks range 
from basic conditioning paradigms to 
more complex verbal processes and 
include problem solving, concept 
formation, and language. 

Chapter 2 examines classical and 
instrumental conditioning in detail. 
1 hese two types ot learning task have 


been extremely important and in¬ 
fluential in all aspects of the field of 
learning. The wide variety of learning 
tasks outlined in Chapter 1 and the 
selected tasks dealt with in greater 
detail in Chapter 2 prepare us to ad¬ 
dress an important question in Chap¬ 
ter 3: Do these various tasks repre¬ 
sent fundamentally different learning 
processes, or are they merely different 
expressions of some common, under¬ 
lying learning process? On the one 
hand, if these tasks do represent fun¬ 
damentally different types of learn¬ 
ing, we will be forced to develop 
theories and models that encompass 
each of the distinct learning process¬ 
es. On the other hand, if some com¬ 
mon process accounts for all of these 
tasks, a single model should suffice. 
Chapter 3 examines representative 
learning tasks in relation to one 
another. Both the similarities and dif¬ 
ferences among them are considered 
in detail. The conclusion is that we 
are not yet ready to affirm with any 
certainty that these tasks represent 
fundamentally different learning pro¬ 
cesses. The similarities among the 
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tasks are strong enough to suggest 
that some unitary process may ac¬ 
count for them all. 

Recent research has suggested that 
not all species are ready to learn all 
responses equally easily. Certain spe¬ 
cies seem prepared to learn certain 
responses but not others. In Chapter 4 


we examine the biological constraints 
on learning and deal with the fact that 
each animal brings to every learning 
situation certain capabilities and pre¬ 
dispositions to act and learn in certain 
ways. These predispositions may be 
formed- by the pressures of natural 
selection. 
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A DEFINITION OF LEARNING 

What do we mean by the word learning ? It is a word we use all the time, and as a 
student you must have some special involvement with or concern for the learning 
process. But how would you define it? Stop for a moment and try to frame a 
definition. 

As you probably found, this is not an easy task. Psychologists have the same 
problem. They, too, are not sure how to define it. Learning is one of those words 
everyone uses and seems to understand but would be hard pressed to define. 
As a result there are many different definitions cilrrent in the field of psychology. 
Some are very specific and concrete; others are diffuse and general. Here is one 
good one. 

Gregory Kimble (1961, 1967) has developed a definition of learning that seems 
to encompass many of the elements of other definitions. Essentially the same 
definition has been proposed by others (for instance, Logan, 1970). Hence, it 
makes a good example. According to Kimble’s definition, learning is a relatively 
permanent change in behavior potentiality that occurs as a result of reinforced 
practice. Let us look at some of the terms in this definition. Why does Kimble 
include the term permanent in his definition? What kinds of behavior change 
would he like to exclude by using permanent ? Among others, he might want to 
exclude temporary changes in behavior resulting from shifts in motivation. When 
we are tired, for example, we sleep. The shift from the waking state to the sleeping 
state represents an enormous change in our behavior. But it does not represent 
new learning. It can be argued that changes in behavior resulting from fatigue 
should not be considered learned changes. Many other motivational factors, such 
as hunger, thirst, and the need for sex, can affect, or change, our behavior without 
involving new learning. If we are hungry, for example, and a steak, a knife, and a 
fork are placed in front of us, we will probably eat the steak using the utensils. 
Our behavior changes. But does the change from not eating to eating represent 
new learning? Probably not. We utilize old, established habits (that is, the fun¬ 
damentals of using a knife and fork), but we are not learning anything new. When 
we have finished eating and our hunger is reduced, we will revert to our former 
mode of behavior (not eating). Hence, temporary changes in behavior, resulting 
from fluctuations in motivational states, tend to be excluded from the category of 
learned changes. 

What about the term practice ? Why does Kimble include it in his definition? He 
seems to be arguing that only changes that occur as a result of practice are learned 
changes and that unpracticed changes can occur, which we would not want to call 
learned changes. Among these are, for example, the effects of aging or matura¬ 
tion. An enormous range of unlearned changes can be attributed to aging. As we 
grow older, for instance, our short-term memory may deteriorate. That is, we 
may have increasing difficulty remembering new material for short periods of 
time. That is a big change. But such a change does not represent learning. We did 
not learn to be more forgetful. In addition, various physiological factors can 
produce unpracticed changes in behavior that are not considered learned changes. 
For example, disease and accidents can yield strong, permanent, unlearned 
changes in our behavior. 

Why does Kimble consider learning to involve changes in “behavior potential¬ 
ity” rather than in behavior? Why complicate the situation? An example may 
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clear up the issue. At this moment, residing within you, somewhere, somehow, is 
the knowledge, or the learning, necessary to brush your teeth. But you are not 
brushing your teeth at this moment. We can say the capacity to brush is latent. It 
is in there somewhere, but it is not being expressed as overt behavior at this 
moment. We have acquired the capacity to brush, but that capacity is not always 
apparent, or expressed overtly. 

Psychologists distinguish between learning and performance. Performance re¬ 
fers to the translation of learning into behavior, usually through the involvement 
of motivation. For example, if your mouth felt very unpleasant, you would 
probably drop what you were doing and brush your teeth. You would be moti¬ 
vated to brush, and it would be the thrust of this motivation that would impel you 
to brush, or to translate the brushing potentiality into observable, overt behavior. 
Learning is “invisible.” It lies within us as a potentiality and can only be observed 
when it interacts with motivation and is translated into, or expressed as, overt 
behavior. 

The distinction between learning and performance is crucial, and we will return 
to it many times in future sections. Performance cannot always be taken as a 
measure of how much learning has occurred. For example, if one child spells 
every word in a spelling test correctly and another child spells none correctly, we 
cannot conclude that the first child has learned more than the second. The second 
child may know all the words, but for any number of reasons she may refuse to 
reveal her learning. When we try to infer how much learning has occurred by 
looking at performance, we must be aware of the fact that motivation as well as 
learning determines performance. 

Finally, consider the term reinforced practice , which Kimble includes in his 
definition. The concept of reinforcement is extremely complicated. For the mo¬ 
ment, think of reinforcement as reward. According to Kimble’s definition, an 
animal will not learn to do something unless the action leads to, or is followed by, 
a reward. Children will not learn to pick up their toys unless the behavior is 
followed by a reward. The reward may come in the form of parental praise and 
approval, or it may be something the children like or enjoy, such as candy or a trip 
to the zoo. The avoidance of punishment may also be rewarding. Children may 
even be able to reward themselves by congratulating themselves. 

The question whether reinforcement is necessary for learning to occur is very 
controversial and has been so for many years. Many psychologists feel that 
reinforcement is essential, whereas others maintain with equal conviction that it is 
not. Whether rewards will facilitate performance does not seem to be the ques¬ 
tion. If a child tells a joke and adults laugh, they may end up hearing that same 
joke to the point of tedium. Few psychologists would argue that reinforcements 
do not help to strengthen behaviors in one way or another. But some of them 
would argue that reinforcement is not absolutely essential for learning to occur. 
They feel that learning can occur in the absence of reinforcement. For example, as 
we drive down the highway, caught up in our latest set of concerns, we pay very 
little attention to the passing scenery. Yet at the end of our journey, if someone 
asks what we saw along the road, we may well be able to remember many events 
and objects. Where or what was the reinforcement for this incidental learning? 

Consider the issue a little more closely. It seems that the problem is to come up 
with a situation in which everyone would agree that learning occurs but that 


CHAPTER 1/WHAT IS LEARNING? A DEFINITION AND SOME EXAMPLES 5 








involves no reward. Does our ability to remember the name of a stranger we met 
only briefly at a crowded party constitute learning without reward? Some argue 
that we have rewarded ourselves in this case. We might pride ourselves on being 
able to remember names. Remember that rewards do not have to be food pellets 
or trips to Bermuda. They can be very subtle and covert. What about the 
incidental highway learning just described? Is that learning without reinforce¬ 
ment? Some have contended that the very acts of perceiving and processing 
information are reinforcing. Many studies seem to suggest that we will learn 
responses that are rewarded by the opportunity' to process information. For 
example, monkeys will learn to open little windows for nothing more than the 
opportunity for visual exploration (Butler, 1953). Thus go argument and counter¬ 
argument, and we begin to see the dilemma. Each time a new, reward-free 
learning situation is proposed, the reinforcement theorist is able to find some 
possible reinforcing event in it. For example, Harlow, Harlow, and Meyer (1950) 
have proposed that the mere manipulation of objects is rewarding, and Hill 
(1965) has argued that any kind of activity can be rewarding. The final step in the 
argument seems to come when it is proposed that the act of learning rewards 
itself. If we accept this, then it becomes impossible to conceive of a situation that 
involves learning without reward. Hence, the issue, as interesting as it is, must be 
laid aside. We shall return to the concept of reinforcement many times, for even 
though it may not be fully understood, it is one of the most powerful variables in 
any learning situation. 

So much for Kimble’s definition of learning. Psychologists have many different 
ideas about what would be an adequate definition. Kimble’s definition merely 
represents one viewpoint. For example, Thorpe (1963) defines learning as “that 
process which manifests itself by adaptive changes in individual behavior as a 
result of experience,” whereas Wingfield (1979) defines it as “a relatively perma¬ 
nent change in behavior or knowledge brought about by practice or experience.” 
As you can see, each of these alternative definitions has something in com¬ 
mon with Kimble’s definition, and yet each has its own emphases and flavor as 
well. For our purposes Kimble’s definition is the most useful and complete one 
available. 


EXAMPLES OF LEARNING 

Another way to bring the field of learning into focus is to describe some of the 
learning tasks psychologists have used and studied in the laboratory. Obviously, 
there are many different learning situations. We can learn to throw a ball, we can 
learn to stay out of trouble, we can learn to be a lawyer at night school, and we 
can learn the birds native to the western United States. These different learning 
situations may or may not represent fundamentally different types of learning. 
They may or may not require different theories to account for them. No one is 
quite sure how many truly different ways we can learn things. As you read about 
the different learning tasks described in what follows, keep in mind that, although 
they may seem quite distinct, they do not necessarily represent different under¬ 
lying learning processes. 

Because we sometimes use the term learning and sometimes conditioning , it is 
necessary to distinguish between these two labels. On the one hand, conditioning 
generally refers to relatively simple learning situations, such as classical condi- 
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tioning and instrumental conditioning, which are discussed below. Learning, on 
the other hand, is a much broader term. It usually refers to conditioning and also 
to more complex situations. In other words, all conditioning situations are simple 
forms of learning, but some learning tasks involve more than conditioning. 

Classical Conditioning 

Most people have heard of Pavlov’s experiments in which dogs were conditioned 
to salivate at the sound of a tone. These experiments represent something called 
classical conditioning. 'Although trained as a physiologist, Pavlov has had a 
tremendous impact on the field of psychology with his classical conditioning 
studies. The experimental paradigm was as follows. Before the conditioning 
experiment (Anrep, 1920) the dog’s salivary duct was surgically treated so that 
saliva flowed through a small opening to outside the dog’s cheek, where it could 
be collected and measured accurately. The dog was then trained to stand in a 
harness, as depicted in Figure 1.1a. A tuning fork was sounded. Several seconds 
later a small amount of dry meat powder was moved close to the dog’s mouth, 
and the dog was allowed to eat it. On the first trial (or pairing of the sound with 
the meat powder) the sound stimulated no salivation, but the food did. The 
procedure was to continue pairing the sound and the meat powder over a series of 
trials. After a number of pairings the tuning fork was sounded alone. No meat 
powder was presented afterward. It was found that the sound alone elicited a 
salivary response. The salivary response had become conditioned to the sound of 
the tuning fork. 

As we shall see, many different responses can be classically conditioned (see 
Figure 1.1b). But the Pavlovian situation allows us to identify the basic elements 
of all classical conditioning situations. (Incidentally, classical conditioning is 
sometimes referred to as Pavlovian conditioning and sometimes as respondent 
conditioning ; these labels can be used interchangeably.) 

THE UNCONDITIONED STIMULUS. The unconditioned stimulus (UCS) in the 
Pavlovian situation is the meat powder. It is a stimulus that, before the condition¬ 
ing experiment, consistently and regularly elicits a response. Every time we 
present the meat powder, the dog salivates. The capacity of the UCS to elicit a 
regular response can be innate or learned. It does not matter, so long as it elicits a 
response consistently before the conditioning experiment begins. 

THE UNCONDITIONED RESPONSE. The unconditioned response (UCR) is the con¬ 
sistent response to the UCS. In the Pavlovian situation the salivation in response 
to the meat powder is the UCR. 

THE CONDITIONED STIMULUS. The conditioned stimulus (CS) is the sound of the 
tuning fork. The CS is a stimulus that does not initially elicit salivation. We pair 
the CS (sound) with the UCS (meat powder) over and over. With repeated 
pairings the CS will begin to elicit a salivary response when it is presented by 
itself. 

THE CONDITIONED RESPONSE. The conditioned response (CR) is the learned 
response elicited by the CS. That is, if we sound the tuning fork alone, and the dog 
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Figure 1.1 Two classical conditioning situations, (a) Pavlov’s salivary conditioning 
apparatus, (b) A modern conditioning situation. After repeated pairings of a tone and 
light shock to the forepaw, the tone alone will begin to elicit forepaw withdrawal. 

[(a) Adapted from Yerkes, R. M., and Morgulis, S. The method of Pavlov in animal 
psychology. Psychological Bulletin, 1909, 6, 257—273. Fig. 2, p. 264. Copyright 1909 by 
the American Psychological Association, (b) From Wickens, C., Tuber, D. S., and 
Wickens, D. D. Memory for the conditioned response: The proactive effect of 
preexposure to potential conditioning stimuli and context change. Journal of 
Experimental Psychology: General , 1983, 112 , 41-57. Fig. 1, p. 43. Copyright 1983 by 
the American Psychological Association. Reprinted by permission. (Photo courtesy of 
Delos Wickens.)] 
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Before 

classical 
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Figure 1.2 The stages of classical conditioning. A UCS will elicit a UCR before 
conditioning, whereas a CS will not elicit a response. During conditioning the CS and 
UCS are presented together, and the CS acquires the capacity to elicit a CR that is similar 
to the UCR. 

salivates, then that salivary response is called the CR. If the dog salivates to the 
UCS alone, that response is a UCR, not a CR. Students often ask if the CR and the 
UCR are the same thing. Saliva is saliva. A good point; but, strange as it may 
sound, the CR and the UCR may well differ in important ways. This problem will 
be considered later in more detail. Figure 1.2 shows the steps involved in classical 
conditioning. 

CLASSICAL CONDITIONING IN EVERYDAY LIFE. Classical conditioning occurs ev¬ 
ery day. You may have had an experience such as this: You meet someone you 
think is very attractive. You approach, saying, “Why don’t you stay awhile? 
Watch TV or something?” Very gentle. Very casual. The person gives you a flat, 
irritable look and says, “What in the world for?” You are humiliated, angry, and 
hurt, but you manage to live through it. A couple of days later one of your friends 
happens to mention the person in passing. You immediately feel a warm flush 
crawling up the back of your neck. That cringing is probably a CR. Many of our 
emotional reactions are probably classically conditioned. If we consistently ex¬ 
perience strong emotions, either positive or negative, in a given situation, then 
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anything that reminds us of the situation in the future may well trigger a 
conditioned emotional reaction. We cannot avoid it. It seems to be automatic. 

Suppose your instructor should march into class one day, raise a pistol loaded 
with blanks above his head, shout “Boo!” and fire. The next day all your 
instructor need do is walk in, raise the pistol, and yell “Boo!” without firing. 
Almost everyone will experience a good CR. 

Instrumental Conditioning 

Imagine this situation: You are in a large, white,'empty room. You do not know 
how you got there, but you do know you are very hungry. Extremely hungry. 
There is a metal bar protruding from one of the walls of the room, and there is a 
little hole in the wall near the bar. There is a ventilation grate in the ceiling but no 
windows and no doors. What would you do in this situation? You might explore, 
try to escape, sit down and cry, pound the walls, pace back and forth, or call for 
help. You might also investigate the metal bar and the small hole. Let’s say that at 
some point in your explorations you press the bar with your hand. As you do so, 
it makes a clicking sound and you jump back, startled. As the click sounds, a 
small cracker rolls out of the hole beneath the bar. You eat the cracker. As you are 
still hungry, you give it another try and eat the resulting food. You have discov¬ 
ered that pressing the bar and the delivery of food are related. You settle down, 
press the bar, and eat until your hunger is satisfied. 

The situation just described, in which the probability of a particular response 
(the bar press) has been increased through the immediate delivery of a reinforcer 
(the food), is instrumental conditioning. (It is also called operant conditioning.) 
The response was instrumental in obtaining something wanted and needed. 

B. F. Skinner, a former Harvard professor and author, has contributed heavily 
to the importance of instrumental conditioning in the field of learning. His work 
with rats and pigeons and his development of the famous Skinner box (see Figure 
1.3) form the basis for much of what we presently know about instrumental 
conditioning. In its simplest form a Skinner box contains a lever and a food tray. 
The apparatus is set up so that, each time the lever is pressed, a small piece of food 
will be delivered to the tray. A hungry rat placed in this box will explore, discover 
the bar, and eventually learn to press the bar regularly in order to obtain the food. 
Figure 1.4 on page 12 shows a more elaborate Skinner box of the type used in 
modern laboratory research. 

Although this learning situation appears very simple, many psychologists feel 
that we can learn a great deal about human behavior by observing rats and other 
animals in this sort of apparatus. Psychologists choose rats as subjects because, in 
addition to being small, inexpensive, curious, and ready to learn, they can be 
handled in ways that would outrage the average human undergraduate. 

Instrumental conditioning must be one of the most common forms of learning 
we encounter in our everyday life. Children learn to play games because those 
activities are rewarding. Our pets learn to respond to our calls because of 
rewards. Students learn course materials for grades, recognition, and advance¬ 
ment. Adults learn the requirements of their jobs for pay, promotions, and the 
pleasure of doing their work. In fact, people who have been in a coma for 6 
months can learn to move their head, eyes, or fingers when rewarded with their 
favorite music (Boyle & Greer, 1983). 
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Figure 1.3 Typical Skinner box 

There is no end to the instances of instrumental conditioning in our life, and yet 
there remains considerable confusion over the relationships among instrumental 
conditioning and the other forms of learning that we shall consider. In Chapter 3 
we shall discuss important differences and similarities among a number of learn¬ 
ing tasks. 


Discrimination Learning 

Another learning task that psychologists utilize and study is called discrimination 
learning. This task can be illustrated by placing a hungry pigeon in a Skinner box 
with two illuminated disks—a blue one on the right and a yellow one on the 
left—attached to the wall of the box. It is characteristic of hungry pigeons to 
engage in a good deal of pecking behavior. They peck the walls, the floor, and the 
disks. The experimenter sets up the situation so that, each time the pigeon pecks 
the blue disk, it is rewarded with a kernel of corn. Each time it pecks the yellow 
disk, it receives no reward. The blue disk is called the positive stimulus , and the 
yellow, the negative stimulus. At first the pigeon pecks the two disks about equally. 
But as time goes on, it responds more and more often to the blue disk, the one that 
yields food. The pigeon discriminates between the colors. Or does it? Can we 
conclude on the basis of the situation described that the animal has learned a 
blue—yellow discrimination? No we cannot, for it may have been learning a 
position discrimination. That is, it may have been ignoring the colors and attend¬ 
ing to the positions of the positive and negative stimuli. If it pecks the one on the 
right, it is rewarded. If it pecks the left disk, nothing happens. We cannot be sure 
whether the animal is discriminating color or position. How can we avoid this 
confusion and ensure that the pigeon is learning a color discrimination? One way 
is to switch the positions of the colors on successive trials. Put the blue one on the 
left part of the time, and put it on the right part of the time. Then the animal will 
be forced to utilize the color cues. 
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Operant Conditioning 
Demonstration 



6 


2 


Gerbrands Operant Conditioning Demonstration Units and accessories offer the profes¬ 
sional and student researcher a reliable, convenient, and economical means of demon¬ 
strating and studying the effects of the schedules of reinforcement. Feeders, cages, and 
other components are the same Gerbrands high quality instruments furnished 
separately to research laboratories throughout the world. These quality features will 


insure years of reliable, trouble-free service: 

1. Solid state control unit with combination 
handle/stand. Functional, easy-to-read 
front panel assures ease of operation. 
Students can operate unit with a minimum 
of instruction. D.C. output: 28 volts at 

3 amps. Output connectors for recording or 
counting responses, reinforcements, and 
other events. 

2. Experimenter’s remote hand-switch provides 
complete control of all basic schedules 

of reinforcement. 

3. Input connector permits automatic program¬ 
ming of all basic schedules of reinforcement. 


4. Low current, fast acting pellet dispenser. 
Dipper feeder and pigeon feeder models 
also available. 

5. External pellet tray with removable clear 
plexiglas cover. Animal eating and nosing 
behavior visible to entire class. Does not 
get fouled with animal feces. 

6. Quality stainless steel rat lever mounted in 
a rigid casting. Maintains pellet force 
adjustment. 

Five different models available. 

Write for complete details. 



GERBRANDS CORPORATION 

8 Beck Road, Arlington, MA 02174 (617) 648-6415 
Quality instrumentation since 1930. 


Figure 1.4 Skinner boxes become big business. (Used by permission of Gerbrands 
Corporation, Arlington, MA) 
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Figure 1.5 The Lashley jumping stand. If the animal leaps against the positive stimulus, 
the card falls over easily, enabling the animal to reach the safety of the table. If the 
animal jumps against the negative card, it bumps its nose and falls to the net below. 

Figure 1.5 depicts a Lashley jumping stand, which is another piece of apparatus 
used in discrimination learning. The rat must learn how to gain access to the food 
table behind the stimuli and how to avoid falling into the net. If it leaps against 
the incorrect, or negative, stimulus, it bumps its nose and falls. If it jumps against 
the correct, or positive, stimulus, the card tips over easily, allowing the animal to 
reach the table. 

It is reasonable to ask if discrimination learning is not just another kind of 
instrumental conditioning. Often it is, but it is a special kind of conditioning in 
which the emphasis is on the formation of a discrimination between two extreme¬ 
ly salient cues. All conditioning involves discrimination learning. For example, a 
rat in a Skinner box must discriminate the bar from the rest of the box. It must 
learn that pressing the wall or its own foot will not do any good. It is just that in 
discrimination learning the focus of the study is on the discrimination process. 
Although the different learning tasks, or situations, that psychologists have 
developed through the years look different on the surface, there is no guarantee 
that they really represent fundamentally different learning processes. There are 
many similarities among the various learning tasks, and it is often difficult to 
separate them beyond distinctions that are made on a superficial, descriptive level. 

The discrimination learning paradigm has definite counterparts in the real 
world. Discrimination learning is an essential part of everyday life. Imagine for a 
moment that you have lost the ability to discriminate. What would life be like if 
you could not discriminate? Most likely, it would be short. If you failed to 
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Figure 1.6 Microcomputers are used to present verbal materials to subjects in learning 
experiments. (Courtesy of International Business Machines Corporation.) 


discriminate red circles from green circles, you would find it difficult to drive an 
automobile in the city. If you failed to discriminate female bodies from male 
bodies, all sorts of peculiar things might happen to you. If you suddenly lost the 
ability to discriminate, you would respond the same way to widely varying and 
sometimes inappropriate stimuli. We all learn to restrict our responses to stimuli 
that will yield rewards and to inhibit our responses to stimuli that yield no reward 
or, worse yet, yield punishment. 

Finally, it should be pointed out that discrimination learning is involved in 
classical as well as instrumental situations. For example, if a dog is given two 
potential CSs (two tones) but only one of them is followed by a UCS, the dog 
must discriminate between the two before the appropriate CS—CR connection 
will be established. 

Serial Learning 

In a typical human serial learning situation the subject is brought into the 
laboratory and is seated in front of a microcomputer monitor (see Figure 1.6). 
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Table 1.1 Serial List Composed of Highly Emotional Words 


JOY 

DEATH 

ACID 

SCREAM 

FEEL 

KISS 

HATE 

RAPE 

LOVE 

GRASS 

SEX 

KILL 


Before the experiment begins, the subject is told that a word or some other type of 
verbal material will appear on the screen. A few seconds later this word will 
disappear, and a second word will take its place. This pattern will be repeated 
until the subject has seen the entire list of, say, 15 words. The subject’s task is to 
remember these words and to remember the order in which they occur. After the 
subject has seen the list once all the way through, he or she is signaled that the list 
has ended and that the computer is about to present the words again in exactly 
the same order. The subject’s task is to anticipate which word will next appear on 
the screen. Each time a word appears, the subject tries to recall and say aloud the 
word that will next appear. This is called serial anticipation learning. 

Serial, or sequential, learning, in which items are learned in a specific order, 
seems to be an integral part of our everyday life. Some examples include the 
memorization of phone numbers, the alphabet, poetry, and historical informa¬ 
tion. Language behavior, in general, possesses a strong sequential component. 
The hope is that our simple serial task will help us understand these more 
complicated serial behaviors. The serial learning task allows us to control, in a 
strict sense, any number of variables that we suspect might be important in 
determining serial learning. For example, Table 1.1 contains a serial list composed 
of highly emotional words. Some people will find this list difficult to learn; others 
will not. 

It should be pointed out that serial, or sequential, learning is not limited to 
verbal tasks or to humans. For instance, animals such as rats are engaging in a 
form of serial learning when they learn to run through a maze. Their everyday life 
is also probably full of serial learning tasks (for example, finding their way along 
rafters and through walls). One popular way to study sequential animal learning 
is to use the serial-pattern design (Self & Gaffan, 1983). Rats run down an alley 
for food pellets. On successive runs the number of pellets might be 14, 7, 3, 1, 0. 
Through repeated exposure to this specific pattern of rewards the rats eventually 
learn to run faster when big rewards are upcoming and slower when smaller 
rewards await them. We will consider exactly what it is that the rats learn in 
Chapter 10. For now it is enough to realize that some kind of serial learning is 
occurring. 
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Paired-Associate Learning 

Paired-associate (PA) learning is another of the many types of task available to 
the learning psychologist. In this task subjects must learn to associate, or pair, two 
items. In a typical human experiment the subjects will again be seated in front of a 
monitor. A stimulus item (for example, a word or nonsense syllable) will appear 
on the left of the screen for a couple of seconds. Then a response item will appear 
on the right. These two words will be paired throughout the experiment. Then the 
screen will be cleared, and a second pair composed of two different words will be 
displayed, first the stimulus word on the left and'then the response word on the 
right. The subjects’ task is to recall and say aloud the correct response word each 
time they see a stimulus word. Typically they have about 2 seconds to try to recall 
each response word before it is displayed by the computer. Lists vary in 
length, and an average one contains 8 or 10 pairs. Table 1.2 contains a typical 
paired-associate list. The subjects are usually asked to respond to the list until 
they have mastered it completely (that is, until they can recall correctly each 
response item given the stimulus items). Sometimes, depending on the experi¬ 
menter’s purpose, all subjects are given a set number of trials rather than asked to 
master the list completely. 

The pairs in a PA list are almost always rearranged from trial to trial, for a very 
simple reason. If the order of the items were kept constant, the subject could learn 
the responses in a serial fashion and totally ignore the stimulus components. We 
would not know whether the subject was learning a serial list composed of the 
response units or a PA list composed of both stimulus and response units. 

Our life is filled with learning situations that have something in common with 
PA learning. Associating names with faces, telephone numbers with people, and 
Spanish words with English equivalents all seem to approximate PA learning. The 
entire process of giving verbal meaning to the world through labeling seems to be 
related to the laboratory paradigm. And animals, too, must learn to associate 
specific responses with specific stimuli. 

Free Recall 

Another interesting task has been used with increasing frequency in recent years. 
In a free recall task subjects are given a list of verbal items and are then asked to 
recall the items in any order. Since the subjects are free to recall the items in any 
order, this task enables us to learn about what they do with, or how they 
organize, the materials that they are trying to learn. From a long list of randomly 
presented nouns, for example, a subject might first recall all the nouns that had 
something to do with people, then move on to all the nouns that had something to 
do with animals, and so on. The subjects cluster responses according to catego¬ 
ries. It seems that recalling the nouns by category makes the job easier. Not 
surprisingly, this suggests that some complicated organizational processes are 
going on inside the subjects. The materials are not spit out in the same order they 
were fed in. The subjects actively process the materials, seeking ways to make the 
task easier. 

If someone asks us to recall all our known blood relatives, we may search 
through some hierarchical scheme. We might start with our maternal grandpar¬ 
ents and trace down through the generations. Or we might try to recall all cousins 
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Table 1.2 A Typical Paired-Associate List Involving Syllables as Stimuli and 
Words as Responses 


STIMULI 


RESPONSES 


TUM 

JAK 

BAV 

GOG 

WIF 

DEX 

LOH 

PEC 


► SINCERE 
*- POTENT 

- FURTHER 
*- BASHFUL 
■*- MIGRANT 

- IMMENSE 

- ACUTE 

- GALLANT 


first, then uncles, and so on. But we would probably not recall the materials 
randomly; we would utilize some scheme, structure, or outline to make the search 
and retrieval processes easier and more efficient. Free recall is one of the tasks that 
enables us to peek into these enormously complicated and individualized organi¬ 
zational activities. 


Lexical Priming 

A great deal of attention has been given recently to the study of lexical memory. A 
lexicon is an ordered set of words (such as a dictionary). Thus, roughly speaking, 
your lexicon is your vocabulary. When psychologists study lexical memory, they 
are interested in how we store, organize, and utilize the words in our vocabulary. 
One of the techniques widely used to study lexical memory is called priming in 
lexical decisions (Henik, Friedrich, &c Kellogg, 1983; Huttenlocher & Kubieck, 
1983; Kiger &c Glass, 1983). The basic procedure is this: Subjects are shown, for a 
split second, an item such as canary and asked to make a rapid decision about it, 
such as whether it is a word or what the word is. But before they are shown 
canary , they are shown a “prime,” or a “probe.” This prime can be related to 
canary (such as bird), or it can be unrelated to canary (such as tar). It has been 
found repeatedly that decisions about the target canary are made faster and more 
accurately if the prime is closely related-to the target. It is as though the prime 
activates or sensitizes the target such that it is more readily processed. As we shall 
see in Chapter 13, priming in lexical-decision situations is one of the basic tools 
used to study the structure of our memory stores. 

Priming differs from the tasks we have been discussing in an interesting and 
important way. Specifically, priming measures or reflects learning that occurred 
before the experiment began (that is, your learned vocabulary), whereas the other 
tasks usually involve new learning that occurs within the experimental session. 
More will be said about this distinction later. 

Concept Formation 

A concept is a symbol that stands for a class of objects or events that have 
common properties (Houston, Bee, & Rimm, 1983). Thus, cow is a concept 
because it stands for a large number of different objects, each of which possesses 
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some characteristics in common with all the others. Boat, towel, and ant are 
concepts because they stand for large numbers of individual objects that, 
although differing in some ways, possess certain identifiable common characteris¬ 
tics. In fact, with a few exceptions all words are concepts, because words stand 
for large groups of objects or events that have something in common. Proper 
nouns are among the words that do not appear to be concepts, because they refer 
only to a single thing (for example, Mars, George Washington). 

Although most words are concepts, not all concepts are words. Concepts can 
be nonverbal. For example, infants can possess'a concept of dog or adult long 
before they have acquired any language at all. Similarly, animals, which are 
nonverbal, can learn concepts too. Herrnstein (1979) reports a study in which 
pigeons were able to form a concept of tree by responding to pictures of trees and 
other objects. 

As we shall see in a later chapter, concepts are enormously important in our 
thinking. They can simplify our thought processes. For example, they free us from 
the task of having to label each and every new object or event we encounter. We 
can normally fit new objects and events into existing categories. When we see a 
new house, for example, we do not have to give that specific object its own unique 
name or label; we just think, “Oh, that’s one more house.” 

Concepts are related to one another in complicated ways. For example, they 
can encompass one another. Flower is a concept. But flower is also an instance of 
the concept plant, which, in turn, is an example of the concept living thing. And 
living thing is an instance of the concept object. In other words, concepts are 
related in complex ways that seem to help us think about and understand the 
world around us. 

Because concepts are so important in thinking, psychologists have been in¬ 
terested in studying them. They have devised sophisticated experimental methods 
that help us understand how concepts are formed, or learned, and how they are 
used. Psychologists, as we shall see in Chapter 12, have also developed a number 
of theories about how concept formation progresses. For now, it is sufficient to 
understand that concept formation is one of the important and complex forms of 
learning that will be explored in this text. 



Figure 1.7 Problem solving. The problem is to overcome, or outflank, the obstacles by 
making the correct response. 
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Box 1.1 Sample Problems (Answers in Box 1.2) 

1. By moving only two matches make four boxes out of these five boxes (using all 



2. Complete this meaningful sequence. 

O T T F F S S _ N 

3. You are on the way to the city and you meet a man at a fork in the road. You 
do not know which fork to take. You know that the man is one of two brothers 
who live in the area. One of these brothers always tells the truth, whereas the 
other never tells the truth. You don’t know which one you are facing. You can 
ask him one question, which he can answer with a yes or no, which will assure 
you of taking the correct fork. What is that question? 

4. Nine steel balls all look alike. Eight of the balls weigh the same amount, 
whereas one is heavier than the rest. You are given a pan balance and allowed 
two weighings of any combination or number of these balls to discover which 
one is the heavy one. 

5 . # • • 

• • • 

• • • 

Without lifting your pencil from the paper, draw four straight lines through all 
nine points. 

6. What is the longest word you can think of that spells the same thing forward 
and backward? 

7. Vacuum is an English word containing a double U sequence. Can you think of 
another one? 


Problem Solving 

Problems refer to situations in which the organism is motivated to reach some 
goal but is blocked from attaining it by some obstacle or obstacles (see Figure 
1.7). Problems come in all shapes and sizes. Some are complex, long-term issues, 
such as the problem of trying to become an attorney, whereas others are short 
term and simple, such as the problem of trying to open the refrigerator door. The 
obstacles may be physical, social, or emotional; they may be real or imagined; 
they may be many or few. 

Box 1.1 contains some sample problems, the answers to which are given on the 
next page. Give them a try. Once you have tried to solve these problems, you will 
be able to reflect on the complexity of the mental activities associated with their 
solution. It is interesting that one of the learning situations concerning us most in 
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Box 1.2 Solutions to the Problems in Box 1.1 


2. OTTFFSSEN 

These are the first letters of the numbers one through nine. 

3. Point to either fork and ask the man if his brother would say this was the 
correct fork. If he says “No,” then it is the correct fork. If he says “Yes,” it is the 
wrong fork. 

4. Weigh any three against any other three. If one group of three is heavier than 
the other, then it contains the heavy ball. If not, then the remaining unweighed 
group of three contains the heavy ball. To determine which of three balls is the 
heavy one, weigh any one against any other one. 



6. POP (so so) 

OTTO (not bad) 
MADAM (very good) 
RADAR (also very good) 
REDIVIDER (excellent) 

7. Continuum 


the real world (problem solving) is also one of the most complicated and difficult 
to study. There is little doubt that we would rather understand problem solving 
than paired-associate learning. Yet the PA task seems to receive more attention in 
the field of learning than does the more complicated, relevant problem-solving 
situation. It is as though psychologists hoped that an eventual understanding of 
the problem-solving situation would follow, or evolve, from a thorough under¬ 
standing of more limited, simplified learning situations. As we shall learn, how¬ 
ever, problem solving remains poorly understood. 

Syllables, Words, and Prose 

Psychologists use a variety of materials in the human learning tasks that we have 
been discussing. Although words are often employed, they are not the only verbal 
unit utilized. As we have seen, nonsense syllables are often employed. Typically, a 
nonsense syllable is a three-letter sequence, such as ZXB or JAX, which is 
presumed to possess little meaning. (CCCs are consonant-consonant-consonant 
sequences, and CVCs are consonant-vowel-consonant sequences.) The idea 
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Table 1.3 The Involvement of Learning in Psychology 


AREA OF PSYCHOLOGY SAMPLE CONCERNS INVOLVING LEARNING 


Physiological 

Perception 

Developmental 

Measurement and testing 

Motivation 

Emotion 

Personality 

Abnormal and clinical 

Social 

Community 

Environmental 

Industrial 


What chemical changes in the brain account for learning 
and memory? What drugs will affect memory? 

Do we learn to perceive depth, or is our depth perception 
innate? How does inaccurate perception affect learning? 

How do children learn sex roles? Are there certain times, 
or stages, when a child is ready to learn certain things such 
as reading? 

Can we teach children to do better on intelligence tests? 

Do intelligence tests measure how much one has learned, 
how bright one is, or both? 

Will the urge to satisfy our curiosity help us learn? How 
do we acquire the need to achieve? 

Are emotions learned? Will strong emotions such as fear 
and anger help us learn, or hinder our learning? 

Can our personality traits be modified by learning? Is 
personality learned? 

Is an abnormal fear of horses explainable in terms of 
learning? Is schizophrenia a learned reaction? How can the 
principles of learning help reduce suffering due to mental 
illness? 

Are attitudes learned? Can we learn to be independent of 
group pressure? Will an audience help or hinder learning? 

How do we teach families and friends to be more 
supportive of people in trouble? Can people having 
difficulties learn to seek support from the community? 

Can learning be improved by varying the environment? 
What is the relationship between crowding and learning? 

Can employees and employers learn to get along with one 
another? How do we best teach people to operate 
machinery in an efficient manner? 


Adapted from Houston, J. P., Bee, H., and Rimm, D. C. Invitation to psychology (2nd ed.). New York: Academic Press, 1983. 
Table 1, p. 179. 


behind their development was that, if we could utilize verbal units that were 
relatively free from preexperimentally established meaning, then we would be in a 
better position to study pure, new learning unencumbered by previously estab¬ 
lished habits. Unfortunately, nonsense syllables are never really devoid of mean¬ 
ing. For instance, VUL, NAP, and even REZ probably elicit some kind of 
responses as you read over them. It may be impossible to come up with a truly 
nonsensical syllable. Partially as a result of this realization, psychologists have 
recently displayed a turn away from nonsense material. In fact, it is fair to say that 
recent years have seen a move toward the study of more meaningful rather than 
less meaningful materials. Thus, as we shall see, the literature is now filled with 
studies of memory for sentences, prose, connected discourse, and language in 
general (Schmidt, 1983; Stanovich & West, 1983a, 1983b; Waters, 1983). 

THE IMPORTANCE OF LEARNING STUDIES 
As a fitting close to this chapter it is appropriate to underscore the fact that 
learning appears to be crucial in most, if not all, areas of psychology. As you can 
see in Table 1.3, psychologists working in all the major fields of the science are 
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concerned with problems involving learning. Without a firm understanding of 
learning these intriguing questions could not be answered. 

SUMMARY 

Learning can be defined in many ways. According to our sample definition, 
learning refers to a relatively permanent change in behavior potentiality that 
occurs as a result of reinforced practice. In this definition, changes in behavior 
resulting from motivational fluctuations, maturation, and various physical and 
physiological factors are excluded from consideration. Learning must be distin¬ 
guished from performance, which is the translation of learning into behavior. 
Reinforcement, under our definition, is necessary for learning to occur. 

A second effort to bring the field of learning into focus involves examining 
examples of learning tasks employed by psychologists in their laboratories. These 
include classical conditioning, instrumental conditioning, discrimination learn¬ 
ing, serial learning, paired-associate learning, free recall, priming in lexical deci¬ 
sions, concept formation, and problem solving. Two points should be kept firmly 
in mind. First, these are only a few of the many learning paradigms employed by 
psychologists. Second, psychologists are uncertain about whether these tasks 
represent many different, distinct types of learning or merely different varieties of 
some common underlying process. 
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INTRODUCTION 

This chapter is devoted to a more detailed consideration of classical and in¬ 
strumental conditioning. These topics deserve special consideration because more 
has been done with these two learning situations than most of the others. 
Classical conditioning will be considered first, and examples, procedures, and 
basic phenomena will be presented. The same kind of consideration then will be 
given to instrumental conditioning. 

Students are sometimes confused by the terms operant conditioning and re¬ 
spondent conditioning. For our purposes, operant conditioning is the same as 
instrumental conditioning, and respondent conditioning is identical to classical 
conditioning. The terms operant and respondent were developed by B. F. Skinner 
and are used by a large number of psychologists. Although some investigators 
attempt to distinguish between operant and instrumental conditioning and be¬ 
tween respondent and classical conditioning (Ellis, 1972; Hall, 1982), this text 
does not. 


CLASSICAL CONDITIONING 
Examples 

Classical conditioning is not limited to situations in which dogs salivate and bells 
ring. If it were, we would not give it a second thought. The phenomenon is, in 
fact, widespread. It occurs across a large number of animals, responses, and 
situations. Let us look at a few selected examples, which should provide some 
idea of the diversity of classical conditioning situations. 

EYE BLINK CONDITIONING. In one of the most widely used classical conditioning 
situations an eye blink is conditioned to some neutral stimulus, such as a sound or 
a light, that does not initially elicit a response (Kehoe, 1983). In these situations a 
puff of air (the UCS) is directed toward the subject’s eyeball. As one would 
imagine, the air puff causes the subject, often a rabbit, to blink (the UCR). At or 
very nearly at the time the air hits the eyeball, a CS, such as a bell, tone, or light, is 
presented. Successive pairings of the UCS (air) and the CS (bell, tone, or light) 
eventually lead to the situation in which the CS alone is capable of eliciting a blink 
(the CR) (see Figure 2.1). 

AVERSIVE CONDITIONING USING SICKNESS AS THE UCR. In another interesting 
example Garcia, McGowan, and Green (1972) describe conditions in which the 
drinking of a saccharin solution (CS) was paired with X irradiation (UCS), which 
makes rats sick. Later the subject rats showed a definite aversion to the taste of 
saccharin. The notion here is that the taste of saccharin (CS) produced a CR 
composed of an unpleasant sensation similar to the original UCR (sickness). If 
you have been thoroughly inebriated, you know that the sight of a liquor bottle 
can make you feel ill all over again. 

More will be said in Chapter 4 about taste-aversion learning, as this form of 
conditioning is often called. For now it is sufficient to realize that this is one of the 
most widely and intensely studied kinds of conditioning (Dawley, 1979; Klunder 
& O’Boyle, 1979; Kurz & Levitsky, 1983; Monroe &C Barker, 1979; Peters, 
1983). 
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Figure 2.1 Diagram of the rabbit eyeblink conditioning preparation. A puff of air 
directed at the eye or a mild shock to the skin below the eye serves as the UCS. Eyeblinks 
are detected by a potentiometer. (From Domjan, M., and Burkhard, B. The principles of 
learning and behavior. Copyright © 1982 by Wadsworth Inc. Reprinted by permission of 
the publisher, Brooks/Cole Publishing Company, Monterey, California.) 

SEMANTIC CONDITIONING. Words and language can be involved in classical 
conditioning as well as the more physiological responses that we have been 
discussing. For example, there is a sizable literature associated with semantic 
classical conditioning (Maltzman, 1977). In a typical semantic conditioning situa¬ 
tion, a series of words might be presented to the subject. Some of these words are 
paired with a mild shock, and some are not. Then in the second phase of the 
experiment words that are similar to the words in the original list are presented 
without shock. It is found that words that are similar to the original shock- 
associated words elicit the same kind of fear and nervousness that the shock 
elicits. Second-phase words that are similar to the nonshock original words elicit 
no fear and no anxiety. In other words, verbal elements can take on meaning 
through a classical conditioning process. Whatever it is that our real mother 
elicits in us can be conditioned to the word mother. Whatever it is that large, 
threatening dogs elicit in us can be classically conditioned to the words mad dog. 
Words can “take on” emotional meaning through classical conditioning. 

MASOCHISM. Masochism refers to a pattern of behavior in which pleasure 
appears to be derived from pain. Although there are many psychodynamic 
interpretations of masochism, the phenomenon can also be interpreted as an 
instance of classical conditioning. 

Pavlov (1927) first used a very mild shock as a CS in a salivary conditioning 
situation. Because the shock was so minimal, the dogs had no fear of it, nor did 
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they hesitate to jump eagerly into the conditioning harness. The shock merely 
served as a cue, or sign, that meat powder was to follow. Gradually over a series 
of days Pavlov increased the intensity of the CS shock. Ultimately he was able to 
use a fairly powerful shock as the CS. Yet the dogs acted as though they did not 
mind the shock at all. They continued to be eager to get on with the experiment. 
The painful shock had become a cue for pleasure (meat powder presentation). 
Had the dogs received the strong shock on their first trial, they would have 
howled. Through this training procedure, however, pain somehow had become a 
sign for pleasure. » 

Human masochistic behavior may be similar. Through the masochist’s history 
pain may become pleasant in the sense that it becomes associated with, or predicts 
the occurrence of, pleasure. For example, a young boy may need the compan¬ 
ionship of his father, but the only way he can get it is to engage in painful 
roughhousing with him. Therefore, the boy suffers the pain to gain the social 
contact he needs. Over time the pain itself might become mildly pleasant because 
of its association with intense social gratification. 

PHOBIAS. Phobias are powerful but irrational fears that affect our life adversely. 
For example, one individual might be terrified of heights, germs, or being out¬ 
doors. Another might be terrified by being indoors. Table 2.1 contains some 
common phobias and their names. Many of these debilitating fears may involve 
classical conditioning. If a boy saw his family being burned to death, for example, 
he might develop a powerful fear of fire through classical conditioning. The UCS 
would be the death of the family. The UCR would be terror. The CS would be fire, 
and the CR would be the fear response connected to and elicited by the fire. In 
Chapter 3 we will learn how many of these phobias can be eliminated through 
our understanding of classical conditioning. 

ADDITIONAL EXAMPLES. Scores of animals and responses have been involved in 
classical conditioning. Chick embryos (Hunt, 1949), planaria (Thompson & 
McConnell, 1955), and human fetuses (Spelt, 1948) have undergone condition¬ 
ing. Heart rates (Black, 1965; Hall, 1976) and pupillary dilation (Gerall, Samp¬ 
son, & Boslo, 1957; Goldwater, 1972) have been classically conditioned. And 
Bitterman, Menzel, Fietz, and Schafer (1983) have successfully conditioned pro¬ 
boscis extension in honeybees (Apis mellifera). The point here is that classical 
conditioning is not restricted to the Pavlovian situation. It is pervasive in the 
animal kingdom, and it is pervasive in human life. 

Temporal Spacing of CS and UCS 

We already know that classical conditioning will occur when the CS and the UCS 
occur simultaneously and when the CS precedes the UCS. But what about when 
the CS follows the UCS? Will classical conditioning occur then? If your instructor 
walked into your classroom, fired a pistol, and then shouted “Boo!” would 
conditioning occur? If it did at all, the conditioned response to the CS would be 
very weak. Thus, we can see that the temporal spacing of the CS and the UCS is 
critical in classical conditioning. Names have been given to the various temporal 
patterns. 
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Table 2.1 Some Common Phobias 


NAME OF PHOBIA 


Acrophobia 

Agoraphobia 

Ailurophobia 

Anthophobia 

Anthropophobia 

Aquaphobia 

Astraphobia 

Brontophobia 

Claustrophobia 

Cynophobia 

Equinophobia 

Herpetophobia 

Mysophobia 

Nycotophobia 

Ophidiophobia 

Pyrophobia 


FEARED STIMULUS 


Height 

Open spaces 

Cats 

Flowers 

People 

Water 

Lightning 

Thunder 

Closed spaces 

Dogs 

Horses 

Reptiles 

Contamination, dirt 
Darkness, night 
Snakes 
Fire 


DELAYED CLASSICAL CONDITIONING. In delayed procedures the onset of the CS 
always precedes the onset of the UCS. The CS is then left on at least until the 
beginning of the UCS. It may then be terminated or continued into, and even 
beyond the duration of, the UCS. 

SIMULTANEOUS CLASSICAL CONDITIONING. In simultaneous conditioning the 
onset of the CS occurs at the same time as the onset of the UCS. In addition, the 
two stimuli are usually terminated together. 

TRACE CLASSICAL CONDITIONING. In trace conditioning the CS is presented and 
terminated before UCS onset. 

BACKWARD CLASSICAL CONDITIONING. In backward conditioning the CS fol¬ 
lows the UCS. Generally speaking, very poor conditioning occurs with this 
pattern. 

Figure 2.2 is a graphic representation of these four types of classical condition¬ 
ing. The delayed procedures are best, and the backward procedures are least 
effective (see Keith-Lucas & Guttman, 1975; Sherman, 1978). The simultaneous 
and trace procedures tend to be of intermediate effectiveness. 

TEMPORAL CLASSICAL CONDITIONING. One final form of classical conditioning 
must be mentioned. In temporal conditioning the UCS is presented at absolutely 
regular intervals; no obvious CS is used. Conditioning still occurs, however, 
because the regular time interval becomes the CS. It is the end of this regular 
interval (say, 10 seconds) that is consistently paired with the UCS and becomes 
the effective CS. 
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Figure 2.2 Temporal arrangements of the CS and the UCS used in four conditioning 
procedures. The upward movement of a line represents the onset of a stimulus, and the 
downward movement of a line represents the offset of a stimulus. 


How Does One Observe a CR? Test Trials and Inhibition of Delay 
Let us assume that your instructor has given you several delayed classical con¬ 
ditioning trials using a pistol and shouting “Boo!” Of course, the class is startled 
and alarmed each time the stimuli are presented. But is the startle response a CR 
or a UCR? Because both the CS and UCS are presented, we cannot be sure. The 
observed startle response may be a mixed response, possessing both learned and 
unlearned components. A good way to differentiate a CR from a UCR is to leave 
out the UCS on a given trial. Any response that then occurs on such a test trial 
must be a CR. 

There is a problem with this procedure for observing a CR. If we leave out the 
UCS on a given trial, we have instituted what is called an extinction trial. In 
classical conditioning, extinction refers to the fact that presentation of the CS 
alone, without the UCS, will lead to a cessation of the response. That is, your 
instructor cannot put the pistol away, say “Boo!” every day for the rest of the 
semester, and expect you to respond each time. You would extinguish, or stop 
responding, after a time. If we try to observe a CR by occasionally leaving out the 
UCS, we are slipping extinction trials into the learning situation. We are diluting 
the learning situation and will obtain a false picture of the increasing strength of 
the CR. It is a dilemma. We want to know if CRs are occurring, but we do not 
want to leave out the UCS for fear of partially extinguishing, or weakening, 
whatever CR strength we already have. The more we leave out the UCS, the more 
checks we have on the developing CR. But an increase in the number of such 
checks also increases the number of unwanted extinction trials. 

Fortunately, nature comes to the rescue. As it turns out, after delayed training 
has progressed for a while, the CR begins to precede the onset of the UCS. That 
is, the CR will occur, or begin, after the onset of the CS but before the onset of the 
UCS. In our example the instructor would call out “Boo!” Then, before the gun 
went off, the class’s conditioned startle response would begin. The CR occurs in 
anticipation of the UCS. Thus, in delayed procedures there is no need to leave out 
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the UCS, because the CR begins in that little delay between the CS and the UCS. If 
the experimenters observe a response occurring before the onset of the UCS, they 
call it a CR. It could not be a UCR because, by definition, a UCR is a response to 
the UCS. The fact that the CR eventually just precedes UCS onset is sometimes 
called inhibition of delay. 

Inhibition of delay helps us when we want to observe CRs in delayed classical 
conditioning, but it does not help us in simultaneous procedures. Of course, it 
does not even occur in simultaneous situations, where both CRs and UCRs be¬ 
gin after the onset of both stimuli. Thus, to observe the CR in a simultaneous clas¬ 
sical conditioning paradigm, test trials must be given, during which the UCS is 
omitted. 


Is the CR the Same as the UCR? 

An eye blink is an eye blink, right? Perhaps not. Classical conditioning has 
sometimes been thought of as the simple transfer of a given response from one 
stimulus (UCS) to another (CS), but additional research has revealed that the CR 
is not identical to the UCR (see Kimble, 1961). For example, the UCR tends to be 
a more potent, “larger” response than the CR. Thus, an unconditioned eye blink 
will tend to involve a bigger movement of the eyelid than a conditioned eye blink 
and will last longer. In addition, UCRs tend to occur faster than CRs in response 
to their initiating stimuli. Specifically, the UCS-UCR interval tends to be shorter 
than the CS—CR interval. 

THE CR AS A COMPONENT OF THE UCR. There are at least two views of the nature 
of the CR. One view holds that the CR is a component of the UCR. In the salivary 
conditioning situation, for example, the UCS (meat powder) elicits a very compli¬ 
cated pattern of reactions, including salivation, tongue smacking, licking, and the 
like, but the CS (tone) elicits only some of these. The same effect appears in other 
conditioning situations. When your instructor shouts “Boo!” without firing the 
pistol, for example, you will experience a startle reaction, but it may not be as 
complete or dramatic as the one that follows the actual firing of the gun. 

THE CR AS A PREPARATORY RESPONSE. A second view holds that the CR is a 
response that prepares the animal for the UCS. In delayed eye blink conditioning 
the CR may begin after the onset of the CS but before UCS onset. It is as though 
the CR is preparing the animal for the UCS (air puff). One way to think about it is 
that the CR may actually allow the animal partially to avoid the noxious air puff. 

These two views, the preparatory and the component positions, are not 
mutually exclusive. The CR may well be a partial response and, at the same time, 
prepare the animal for the UCS. 

Responses Beyond the CR and UCR 

Another interesting discovery is that in delayed eye blink situations many differ¬ 
ent kinds of blinks are occurring, not just CRs and UCRs. For example, random 
blinks often occur, and they are a potential source of confusion in conditioning 
experiments. If the subject happens to blink randomly just before the onset of the 
UCS, there is a danger that the experimenter will call that blink a CR.-In addition, 
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some subjects are more than helpful in conditioning situations; they blink volun¬ 
tarily. The experimenter does not want to count these voluntary responses as true 
CRs. Fortunately, voluntary responses tend to begin earlier in the CS—UCS 
interval than true CRs and to last longer. But they are a problem. Finally, there is 
the alpha response, a small blink that occurs immediately after the onset of the 
CS. It can easily be confused with a true CR even though it occurs earlier in the 
CS—UCS interval and is smaller than the typical CR. The alpha response is a UCR 
to the CS, or a small startle response. The bell alone occasionally causes the 
subject to blink a little. The alpha response is not a learned response but is a reflex 
response to the CS. 

We can begin to see that classical conditioning is quite complicated (see 
Holland & Rescorla, 1975). What began as a simple situation, in which one 
response was supposed to be transferred from one stimulus to another, turns out 
to involve many different responses. (For deeper excursions into the nature of 
classical conditioning you are referred to Wickens, 1984, and Wickens, Tuber & 
Wickens, 1983.) 

Some Selected Phenomena and Areas of Research 
in Classical Conditioning 

PSEUDOCONDITIONING. Pseudoconditioning is an interesting phenomenon in 
which the UCS is first presented alone for a series of trials. Then the CS is 
presented alone. On this very first presentation of the CS, which has never been 
paired with the UCS, something resembling a CR is elicited. For example, 50 
successive solitary puffs of air may be directed toward the subject’s eye. Then a 
bell may be sounded by itself. The subject will blink. The effect is called pseudo¬ 
conditioning because the CS and the UCS have never been paired. The argument 
is that there has been no opportunity for learning to occur. Pseudoconditioning 
has been observed in humans (Prokasy, Hall, & Fawcett, 1962), cats (Harlow & 
Toltzein, 1940), goldfish (Harlow, 1939), and many other species (see Razran, 
1971). 

Pseudoconditioning can be demonstrated in the classroom. Suppose that your 
instructor fires a blank pistol on 20 successive mornings without ever saying 
“Boo!” Then on the 21st day the instructor says “Boo!” instead of firing the gun. 
Pseudoconditioning would be demonstrated if you, as a subject, reacted to the 
verbalization in a manner similar to your response to the report of the gun. 

There have been several different interpretations of pseudoconditioning. One 
of them argues that the response to the CS is a result of generalized excitement. 
The implication is that after a series of UCS presentations we are so excited and 
ready to respond that we will respond to anything, even a neutral stimulus such as 
a tone. Another interpretation argues that pseudoconditioning may actually be 
true classical conditioning and not “pseudo” at all. How could that be? Where is 
the pairing of a CS and a UCS that is essential to the classical conditioning 
paradigm? The trick is to find the CS. We are looking for something common to 
the presentation of the UCS alone (air puff) and to the presentation of the CS 
alone (bell). One possibility is a change from nothing to something, or a change 
from no stimulation to stimulation. This change is common to the presentation of 
the air puff and the bell. During the series of trials when the UCS is being 
presented, our newly discovered CS (change from no stimulation to stimulation) 
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Figure 2.3 Relationship between conditioning and the CS-UCS interval. Note that the 
maximum frequency of conditioned responses occurs when the CS-UCS interval is 
approximately .5 second. (Adapted from Spooner, A., and Kellogg, W. N. The backward 
conditioning curve. American Journal of Psychology, 1947, 60, 321—334. Fig. 2, p. 327. 

© 1947, 1975 by the Board of Trustees of the University of Illinois. Reprinted by 
permission of the University of Illinois Press.) 

is present. It is “paired” with the air puff. Then when the bell alone is presented, 
our subtle CS (change from no stimulation to stimulation) is also present (see 
Wickens & Wickens, 1942). Thus, pseudoconditioning may actually be true 
classical conditioning in which the CS is a subtle energy change. 

THE CS-UCS INTERVAL. The amount of time between CS onset and the onset of 
the UCS has long been one of the major concerns in the field of classical 
conditioning. Generally speaking, a CS-UCS interval of about .5 of a second has 
proved optimal in many situations. Although the data suggest that this is true, the 
reasons for it are not at all clear. Studies that have varied the CS—UCS interval 
suggest that either longer or shorter intervals are less effective in the development 
of conditioned responses (see Figure 2.3) (Kimble & Reynolds, 1967; Spooner & 
Kellogg, 1947; Wolfle, 1930,1931). However, the relationship between condition¬ 
ing and the CS—UCS interval is not a simple one. Although the optimal interval is, 
in general, around .5 of a second, there are cases in which this rule does not hold. 
For example, Ross & Ross (1972) present data suggesting that at least in some 
situations the optimal interval is strongly affected by the complexity of the CS. In 
some of their studies longer CS—UCS intervals were needed for optimal con¬ 
ditioning when a complex CS was being used. Although they are controversial 
and not well understood, there are additional cases in which classical condition¬ 
ing appears to occur over very long CS—UCS intervals. For example, Kehoe, 
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Gibbs, Garcia, & Gormezano (1979) used what is called a serial compound CS. 
What this means is that two CSs (such as a bell and a buzzer), rather than just 
one, were presented before the UCS was presented. In one condition the first CS 
(CS1) was presented. Then, approximately 18 seconds later, the second CS (CS2) 
was presented. Then CS2 was followed by the UCS. In this serial pattern it was 
found that CSl came to elicit a CR, something that would not have occurred had 
CS1 been presented alone without the introduction of CS2 just before UCS onset. 
Conditioning to CSl alone would not have occurred because the CSl—UCS 
interval was too long (18 seconds). Exactly why the introduction of CS2 helped 
conditioning over this relatively long interval is not known, but an upcoming 
section will have something to say about two possible explanations. The point 
here is that classical conditioning did occur over a long (approximately 18- 
second) CS—UCS interval. 

Surprisingly, classical conditioning can be obtained over even longer CS—UCS 
intervals. In the bait-shyness effect, discussed in detail in Chapter 4, conditioning 
appears to be demonstrable even when the interval is one of hours rather than 
minutes or seconds (see Dickinson & Mackintosh, 1978). The relationship between 
conditioning and the CS—UCS interval is far from simple and is not completely 
understood at the present time. 

CS AND UCS INTENSITY. The intensity of the CS and of the UCS also seems to be 
important in classical conditioning. In general, more intense CSs and UCSs will lead 
to faster conditioning and to a higher degree of final conditioning (see Kamin & 
Brimer, 1963; Kehoe, 1983). This relationship seems to hold true over quite a broad 
range of stimulus intensities. If CS or UCS intensity is too high, however, condition¬ 
ing may be impaired. This disruption may occur because very intense stimuli 
probably elicit strong unwanted unconditioned responses that mask the occurrence 
of the conditioned responses that the experimenter is trying to observe (see also Grice 
&C Hunter, 1964). 

MULTIPLE-RESPONSE MEASURES. A recent trend in the study of classical con¬ 
ditioning focuses on the fact that classical conditioning involves many different, 
and perhaps related, changes within the organism (Black & Prokasy, 1972). In eye 
blink experiments, for example, the experimenter has traditionally considered the 
blink to be a specific CR to a specific CS. As it turns out, the blink is not the only 
response that can be elicited by the CS. There may be important concurrent 
changes in, for example, heart rate and skeletal activity. The new trend in the field 
is to examine the relationships among these various response systems, to look at 
the actions of the whole organism in response to the CS, and to avoid looking at a 
simple CR, such as the eye blink in isolation. The questions being asked are: How 
are the different response systems related? Can changes in one be predicted from 
changes in the others? The goal is to understand the overall pattern of behaviors 
involved in classical conditioning. 

The situation is complicated further by the fact that different CSs tend to elicit 
different constellations of responses that make up the overall CR. A light used as 
a CS, for example, will tend to elicit a set of CR components that is slightly 
different from the set of responses elicited by a bell or a buzzer (Holland, 1977). 
Holland (1979), for instance, has shown that an auditory CS may be associated 
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more strongly with a startle response and head jerks, whereas a visual CS may be 
associated with rearing (standing) behavior in the rat. In summary, the nature of 
the CR that we obtain in classical situations is, at least in part, dependent on the 
nature of the CS. 

CS PREEXPOSURE EFFECT: LATENT INHIBITION. Repeated presentation of the CS 
before presentation of the paired CS and UCS will sometimes lead to poorer 
conditioning than if the CS is paired with the UCS from the outset. If a tone is 
sounded alone for 150 trials and is then paired with an air puff, the resulting eye 
blink conditioning will be slower than if the tone had not been presented alone. 
The phenomenon is called latent inhibition, or the CS preexposure effect. Some 
workers have felt that the preconditioning exposure to the CS reduces the sub¬ 
ject’s attention, or orientation, to the CS. In other words, if someone rings a bell 
endlessly, we will adapt to it, or pay less attention to it, and its effectiveness as a 
cue in learning will be reduced accordingly. Much of the work on latent inhibition 
has been done in the Soviet Union, but its appeal has been spreading in the United 
States and Europe (Best, Gemberling, & Johnson, 1979; Channell & Hall, 1983; 
Dawley, 1979; Hall & Pearce, 1979; Pearce & Hall, 1979). 

UCS PREEXPOSURE EFFECT. The UCS preexposure effect (sometimes called 
adaptation or habituation) refers to the situation in which the UCS, rather than 
the CS, is presented alone for a series of trials before the CS and UCS are paired. 
When this is done, conditioning is again slowed (see Batson, 1983; Randich & 
Haggard, 1983). 

At this point one might ask about the relationship between UCS preexposure 
and pseudoconditioning. Both procedures involve the same manipulation. The 
UCS is presented alone for a series of trials. Yet in pseudoconditioning something 
like a CR is obtained on the very first presentation of the CS, whereas in 
adaptation, conditioning is slowed over a series of CS—UCS pairings. It seems that 
the difference between the two phenomena may lie in the fact that the pseudocon¬ 
ditioning effect appears on the very first presentation of the CS and is a transitory 
effect. That is, its impact will not be observed over a long series of CS-UCS 
pairings. Adaptation, in contrast, refers to a decrement that occurs over a long 
series of CS—UCS pairings. Pseudoconditioning may actually occur in adaptation 
experiments on the first trial or two, but it is neither strong enough nor lasting 
enough to affect, or counteract, the decrement in conditioning that appears over a 
long series of CS—UCS pairings. 

COMPOUND STIMULUS EFFECTS. Rescorla and Wagner (1972), and others, have 
argued that classical conditioning involves competition for associative strength 
among all the stimuli present in the environment. Although the experimenter 
focuses on and manipulates a CS such as a bell, there are other stimuli in the 
situation (for example, the background stimuli) that might potentially serve as 
functional CSs. Thus, it becomes the task of the investigator to tease out and 
isolate those stimuli that are actually functioning as the CS in any given situation. 

Many recent studies have employed compound stimuli in these efforts to isolate 
and understand effective CSs. A compound stimulus is a stimulus composed of 
two or more discrete components. Thus, a compound CS might be composed of a 
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bell and a buzzer or of a light, a bell, and a shock. The question becomes one of 
determining which components will become effective CSs and under what condi¬ 
tions. The following compounding phenomena represent some of the results that 
have already been obtained in this intriguing search. 

1. Sensory preconditioning. Assume that we are dealing with two discrete 
stimulus elements—say, a bell and a light. Call the bell A and the light X. The first 
step in demonstrating sensory preconditioning is to pair A and X together 
repeatedly; then pair A with a UCS such as an air puff to the eyeball until A 
begins to elicit a conditioned blink; and finally,'present X alone. If X elicits a 
blink, even though it has never been paired with the air puff, then sensory 
preconditioning has been demonstrated (Fudim, 1978; Pfautz, Donegan, & Wag¬ 
ner, 1978). 

2. Higher-order conditioning. Also known as second-order conditioning, high¬ 
er-order conditioning refers to the situation in which the CS from one learning 
task is used as the UCS in a subsequent task (Cheafle & Rudy, 1978; Szakmary, 
1979). For example, a bell might be paired first with meat powder in a salivary 
conditioning situation. Once the bell was eliciting a conditioned salivary re¬ 
sponse, the meat powder could be removed entirely, and the bell could be paired 
with a light. Higher-order conditioning is demonstrated if the light begins to elicit 
a conditioned response. In effect the bell is now the UCS, and the light is the CS. 
Interestingly, some responses, such as fear, seem to be much easier to second- 
order condition than others, such as the eye blink (Popik, Stern, & Frey, 1979). 

3. Overshadowing. First, pair AX with a UCS. Then test for the effectiveness 
of A and X alone in eliciting the CR. If one element displays greater control over 
the CR than the other, we say that overshadowing has been demonstrated 
(Odling-Smee, 1978). The fact that the CS component manipulated by the ex¬ 
perimenter usually displays greater control than the background stimuli in the 
environment is a form of overshadowing (see Couvillon, Klosterhalfen, & Bitter- 
man, 1983). 

4. Compounding. Pair AX with a UCS. Then test with A, with X, and with 
AX. If a CR is elicited by the AX compound but by neither A nor X alone, then 
we say that compounding has occurred. 

5. Kamin’s blocking effect. Pair A with a UCS. Then pair AX with the same 
UCS. Then test for the effectiveness of X in eliciting a CR. In this case X will often 
be less effective in eliciting the CR than it would be if A had not initially been 
paired with the UCS. In other words, the fact that A already elicits the CR 
somehow blocks the establishment of a strong X—CR connection during the 
AX—UCS pairing (Cheafle & Rudy, 1978; Haggbloom, 1983; Kamin, 1969b; 
Kohler & Ayres, 1979; Rescorla & Colwill, 1983; Stickney & Donahoe, 1983). 

6. Conditioned inhibition. Also known as conditioned suppression, con¬ 
ditioned inhibition refers to the following effect. On some trials A is paired with 
the UCS. On other trials AX is paired with the UCS. Conditioned inhibition is 
demonstrated when it is found that A elicits a CR, whereas AX is less likely to do 
so (Memmott & Reberg, 1977; Rescorla & Holland, 1977). 

7. Serial compounding. We have already seen in connection with our discus¬ 
sion of the CS—UCS interval that conditioning over long intervals can occur when 
two successive CSs are presented before the UCS in a CS1—CS2—UCS sequence. 
This stimulus compounding effect is particularly instructive, because it points out 
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how many of the stimulus compounding effects that we have discussed may be 
interrelated. Specifically, it has been suggested that the serial compound effect 
might be explainable in terms of some of the other compounding effects (Kehoe et 
al., 1979). The long CS—UCS interval might be bridged by either a sensory 
preconditioning effect or by higher-order conditioning. In terms of the sensory 
preconditioning effect it would be argued that CS1 and CS2 are paired, that CS2 
is then paired with the UCS, and that CS1, because of its pairing with CS2, would 
then be capable of eliciting a CR. The higher-order conditioning explanation 
would hold that CS2, initially paired with the UCS, becomes the effective UCS 
when it is paired with CS1. Whether these explanations will hold up remains to be 
seen. But they, and all the stimulus compounding effects, do indicate the complex¬ 
ity and, yet, the predictability of stimulus control effects in classical conditioning. 

Informational Theories 

Recent thinking about classical conditioning has led many investigators to the 
conclusion that conditioning is most effective when the CS predicts the occur¬ 
rence of the upcoming UCS for the subject (Dickinson & Mackintosh, 1978; Frey 
& Sears, 1978; Mackintosh, 1975; Rescorla & Wagner, 1972). Conditioning is 
most effective when the CS signals the fact that the UCS is impending or when it 
provides the subject with information about the very near future. 

The argument is that subjects will start blinking (CR) in response to a bell only 
when that bell (CS) lets the subjects know that an air puff (UCS) is about to hit 
their eye. If the bell does not help the subjects know when the air puff is about to 
occur—that is, if the air puff is as likely to occur in the absence of the bell as in its 
presence—then little or no conditioning will occur. It makes sense, doesn’t it? In 
fact, Rescorla and Wagner (1972) have found that, when the UCS is to occur 
equally with or without the CS, the CS will not begin to elicit a CR. But when the 
UCS is more likely to follow CS presentation than it is to occur in the absence of 
the CS, the CS has informational value. It tells the subject that the UCS is about to 
occur . . . and the subject blinks. This approach, termed the informational theory , 
is known as the expectancy approach. The CS must predict the impending 
occurrence of the UCS, or conditioning will be minimal. 

BLOCKING AND THE RESCORLA-WAGNER MODEL. The Rescorla-Wagner idea is 
that, if a stimulus doesn’t tell the subject anything new or additional about the 
upcoming occurrence of the UCS, that stimulus will not become an effective CS. 
This hypothesis can be used to explain, among many other things, the blocking 
effect. Remember that in blocking, a stimulus A is first paired with a UCS. Thus, 
through this pairing A already signals the upcoming occurrence of the UCS. Then 
X is paired with A and followed by the same UCS. But X doesn’t convey any 
additional information, so it doesn’t become an effective CS. It is redundant in the 
sense that any information it might convey is already being conveyed by A. 

Practical Uses of Classical Conditioning 
Before moving to consider instrumental conditioning it should be made clear that 
classical conditioning has been put to good use outside the laboratory. For 
example, it has been used to control enuresis, or bed-wetting (Azrin, Hontos, & 
Besalel-Azrin, 1979), blood pressure (Whitehead, Lurie, & Blackwell, 1976), and 
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irrational fears (Wolpe, 1958). Box 2.1 contains a detailed example of one 
practical use of classical conditioning. 

SELECTED EXAMPLES OF INSTRUMENTAL CONDITIONING 
Instrumental Reward Training 

A rat’s learning to press a bar for food in a Skinner box is a straightforward 
example of reward training. The animal is rewarded when it makes a particular 
response. As is true of classical conditioning, reward training is pervasive in our 
life as well. Children learning to tie their shoes for parental approval, adolescents 
learning a paper route for spending money, college students absorbing course 
materials for a grade, athletes mastering a sport for money and recognition, and 
college professors learning to publish experiments for promotions all represent 
reward training. The essential characteristic of the situation is that the animal, or 
human, must first make a particular response before it receives something it wants 
or needs. 

Female Rhesus monkeys will learn to press levers when lever pressing is 
rewarded by access to male monkeys. (Incidentally, when given a choice between 
males, females will lever-press for the males that groom the most and ejaculate the 
most [Michael, Bonsall, & Zumpe, 1978].) Even garter snakes will learn for 
rewards. Kubie and Halpern (1979) have found that they will learn to follow 
trails through mazes that have been scented with earthworm extract when they 
are fed at the end of the maze. The literature is filled with hundreds of studies of 
reward training, some of which we examine below. 

SHAPING. Shaping is an important concept developed in connection with reward 
training. It refers to the reinforcement of successive approximations to a desired 
response. The best way to train a rat to press a bar is first to reinforce approxima¬ 
tions to the full-blown bar press, then to reinforce closer approximation, and 
finally to reinforce only an actual bar press. Initially, we reinforce the hungry 
animal each time it turns toward the bar. After this response is firmly established, 
we reinforce the animal only if it moves toward the bar. Finally, we reinforce the 
animal only if it actually presses the bar. In other words, we slowly guide the 
animal toward the response we are interested in. If we were to wait for the full bar 
press to occur “spontaneously” before we reinforced anything, we might never 
bring the response under our control. 

This same sort of shaping technique can be used in many human situations. For 
example, it is often used in attempts to teach autistic children to speak. In 
attempting to establish verbal repertoires psychologists may deprive these dis¬ 
turbed children of one meal and then reinforce them with food (for example, bits 
of sugar-coated cereal) each time they make any sound at all. At first, any sound is 
reinforced. Grunts, snorts, syllables, and squeaks are all reinforced. Then, as 
training progresses and the rate of sound production goes up, the teacher becomes 
more selective in reinforcing. He or she refrains from reinforcing just any sound 
and will begin to reward only those sounds that begin to approximate some 
desired verbal response (for example, a word). Gradually the child’s sounds are 
shaped to conform to the desired pattern. This shaping technique appears to be 
quite successful in teaching previously mute children to produce distinct verbal 
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Box 2.1 The Polygraph, or “Lie Detector” 

We have all heard of polygraph machines or “lie detectors.” They are supposed to be able to 
detect when a person is lying by recording changes in blood pressure, pulse rate, breathing, 
and the amount of perspiration produced by the skin. But, strictly speaking, they are not He 
detectors at all. They are emotion detectors. By recording changes in the four variables 
mentioned, we can detect strong, classically conditioned emotional reactions. As you know 
from experience, strong emotions such as fear or anxiety can cause an increase in sweating 
and heart rate, as well as a change in breathing. 

For example, suppose as a young child you managed to crawl under your house and 
stumbled into a nest of large, fat, brown spiders. Suppose you suffered a number of bites, to 
say nothing of the scrapes and bumps acquired during your spectacular exit. In later years the 
mere mention of spiders, particularly fat brown ones, might be enough to trigger a strong 
sense of disgust and fear. This reaction could be detected by the polygraph machine, even if 
you tried to conceal it. The situation involves classical conditioning. The UCS is the 
experience with the spiders, the CS is the verbal label “spider,” ana the CR is the fear and 
revulsion triggered by the mention of spiders. 

Obviously, polygraph machines have been used in police work. The notion is that criminals 
will lie to conceal their guilt, but they cannot control their emotional reactions, and so the lie 
detector will pick up these “signs of guilt.” However, there are serious doubts about this use 
of the polygraph. For example, suppose four big police officers escorted you to a police 
station, attached you to a strange machine with wires, looked at each other significantly, and 
asked, “Did you murder Elizabeth K. Stone?” Even an innocent person might have a strong 
emotional reaction to such a question asked under these circumstances. The polygraph 
cannot distinguish between types of reaction, such as guilt or alarm. It can only measure the 
amount of reaction. Therefore, such direct questions are unsuitable for detecting lies—or 
truth either. 

One way to avoid the possibility of innocent people responding emotionally to such direct, 
general questions is to consider details of the crime, or to focus upon elements of the crime 
that could be known only by the guilty party. For example, suppose a victim has been 
murdered with the leg or a chair. The police might show all suspects a series of pictures of 
common objects, including the actual murder weapon. In this case, the murderer might reveal 
his guilt by reacting strongly to the murder weapon but not to the remaining objects. 

Innocent parties, on the other hand, would be unlikely to react strongly to the leg of a chair. 

Even with these refinements, the use of polygraph records has been extremely controversial, 
and their reliability has been questioned often. Some types of psychopathic individuals appear 
to be able to “beat the lie detector” because they have no strong feelings about their crimes. 
Similarly, drugs of various sorts will affect polygraph records. 

In addition to police work, the polygraph has been used in psychotherapy situations. For 
example, suppose a therapist is attempting to determine what is bothering a new patient. She 
asks tne patient if he is in any way disturbed by, or concerned about, his sex life. He responds 
with a knowing look and a little chuckle, “No. No. Everything is just fine there. Just fine.” 

But if, at the same time the patient is speaking with such outward calm, the recording needles 
of the polygraph machine are jumping wildly, the therapist might suspect that the controlled 
verbal report was concealing strong inner reactions to tne topic of sex. The word “sex” might 
be a CS ror all sorts of emotional reactions, and the polygraph could be useful in detecting 
this area of concern. 

Whatever the final outcome of the controversy surrounding the polygraph, there is no 
doubt that it represents another way in which tne principles of classical conditioning have 
been put to work outside the laboratory. 
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behaviors (Hewett, 1965; Kerr, Myerson, & Michael, 1965; Lovaas, 1967; 
Lovaas, Berberich, Perdoff, & Schaeffer, 1966). Sometimes children who would 
otherwise remain nonverbal can be taught sign language instead of speech be¬ 
havior (Carr & Kologinsky, 1983). And finally the siblings of autistic children can 
be taught how to use reward training. Not only do they help their autistic 
brothers and sisters, but they end up liking them better, too (Schreibman, O’Neill, 
& Koegel, 1983). 
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TOKEN ECONOMIES. Reward training, as we just saw, is not limited to labora¬ 
tory exercises. It has often been applied successfully in dealing with practical 
problems. Token economies represent a very good example of the usefulness of 
reward training. In a token economy desirable behaviors are immediately rein¬ 
forced or rewarded with tokens, which may later be exchanged for primary 
reinforcers such as food, beverages, and privileges. For example, the staff of a 
mental hospital may reinforce desirable behaviors (positive speech, cooperation, 
completion of chores, neat appearance) with poker chips, gold stars, or check 
marks, which may later be turned in for any number of more basic rewards 
(clothing, personal items, games, tools, snacks, television time, recreational 
privileges). 

Tokens have been used in a number of settings, including hospitals, schools, 
outpatient clinics, and homes. For example, Atthowe and Krasner (1968) success¬ 
fully utilized a token economy on a closed ward of chronic psychiatric patients in 
a Veterans Administration hospital. Tokens were awarded when the patients 
cared for their personal needs, attended their scheduled activities, helped on the 
ward, or showed increased responsibility in any way. Similarly, Schaefer and 
Martin (1966) report success in overcoming “apathetic” behaviors displayed by 
hospitalized schizophrenics through the use of token reinforcement procedures. 
Tokens that could be exchanged for cereal, candy, or small toys have also been 
shown to be effective in training retarded children to use sentences (Lutzker & 
Sherman, 1974). Tokens have been used in large-scale educational systems (see 
Klein, 1979) as well as in hospitals. One need only look in a recent issue of the 
Journal of Applied Behavior Analysis or some similar journal to discover the 
widespread use of the technique. 

OTHER REWARDS. Tokens are by no means the only rewards that have been used 
successfully. For example, Pierce and Risley (1974) have demonstrated that the 
opportunity for recreational activities can serve as an effective reinforcer. They set 
up a situation in which the young members of an urban recreational center could 
obtain additional recreation time if they recruited new members for the center. If 
a youngster brought in one new member, he or she was allowed to enter the center 
an hour earlier than the other members. The technique appeared to be fairly 
successful in a series of recruitment drives. 

Powell, Felce, Jenkins, and Lunt (1979) point out that one of the chronic 
problems found in homes for the elderly is that the guests often spend excessive 
amounts of time just sitting about doing nothing. They engage in very little 
physical activity or social interaction. These authors report that they were able to 
increase healthy social and physical activities by rewarding these activities with 
the opportunity to engage in a little indoor gardening. The chance to garden, in 
itself enjoyable, was an effective reinforcer when it was made contingent on the 
occurrence of other sorts of engagement. 

Knight and McKenzie (1974) have reported that bedtime thumb sucking can be 
eliminated through the manipulation of story reading. Several young girls, all 
dedicated thumb suckers, were reinforced for not thumb sucking by being read a 
bedtime story. The authors report that this method was extremely successful. 
(There are those who would probably argue that the behavior need not have been 
eliminated under any circumstances.) 
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Social rewards are probably used more widely than any other type of reward. 
Factors such as praise, attention, flattery, and support are powerful reinforcers for 
most people. For example, Rekers and Lovaas (1974) report a case in which a 
young boy had taken on many female characteristics. He preferred feminine 
clothes and activities, his voice had distinctive feminine qualities, and he enjoyed 
feminine cosmetic articles. In attempting to change this behavior pattern (some 
might want to argue that it should not have been changed), the mother of the 
child was taught to socially reinforce masculine behaviors. She was taught to 
praise, encourage, and support masculine activities and interests and to ignore 
(extinguish) feminine behaviors. Although social reinforcers were not the only 
ones used in this study, they did form an important part of the situation and did 
appear to have a significant impact on the child’s behavior. The authors report 
that 3 years later the boy displayed most of the characteristics of traditional 
masculine behavior. 

ADDITIONAL BEHAVIORS. Just as a wide variety of reinforcers have been used in 
practical situations, so, too, have many different behaviors been altered through 
the use of reward training. Everything from tics (Finney, Rapoff, Hall, & Chris- 
tophersen, 1983) to dog bites (Tortora, 1983) have been altered. For example, a 
potentially dangerous form of self-starvation has been treated using reward 
training. Anorexia nervosa often affects intelligent young women. They simply 
stop eating and, on occasion, have been known to die from this condition. It has 
often been assumed that the condition is a physiological one, and medical doctors 
sometimes recommend tube feeding to keep the patients from starving. 

Garfinkel, Kline, and Stancer (1973) decided to treat the condition as if it were 
a learned rather than a physiological condition. This was a bold decision, because 
so little research had been done on the condition. They treated five women who 
were hospitalized with the condition. Basically, they rewarded weight gains and 
ignored losses. The reinforcers were weekend passes and opportunities to engage 
in social activities. Within 12 weeks almost all of the patients’ original weight had 
been regained. 

Care should be taken in concluding that positive reinforcement techniques are 
the cure for anorexia nervosa. Other investigators (for example, Van Buskirk, 
1977) point out that the situation can often be quite complex and that neither 
behavior modification nor psychotherapeutic techniques are perfect cures. 

Many other behaviors, including overeating (Wilson, 1979), energy conserva¬ 
tion (Seaver & Patterson, 1976), and grief (Ramsay, 1979) have been altered 
through reward training. The point here is that psychology is doing some good 
beyond the confines of the laboratory by using the principles of conditioning. 

Avoidance Training 

EXAMPLE. In avoidance training the animal receives unpleasant stimulation 
when it fails to make a particular response. The apparatus depicted in Figure 2.4 
can be used to demonstrate avoidance training. One side of the box is painted 
black. The other side, painted white, has a metal grid floor. Electric current can be 
passed through this grid. A hurdle, or fence, separates the black side from the 
white side. When a rat is placed in the white side, it engages in normal rat 
behavior, nosing about and exploring the box. The apparatus is set up such that 


CHAPTER 2/CLASSICAL AND INSTRUMENTAL CONDITIONING 39 






Figure 2.4 Apparatus in which rat can be conditioned to avoid a shock. 

the current will be turned on in 15 seconds. If by that time the animal has not 
climbed over the barrier, removing itself from the white box, it will be shocked. If 
it does move into the black box before the 15 seconds have elapsed, it will not be 
shocked. When the rat is first placed in the box, it will typically wander about. 
When it receives the first shock, it squeaks, leaps about, and soon scrambles into 
the black box. The experimenter begins another trial, picking up the rat and 
placing it back in the white box. The rat again has 15 seconds to jump the fence or 
be shocked. You can easily imagine what will happen in this situation. Over a 
series of trials the rat learns to jump the fence promptly when placed in the white 
box. No question about it. It is leaving, thank you very much. After a series of 
trials the rat no longer receives any shock at all. It has learned to avoid the shock. 

EXTINGUISHING THE AVOIDANCE RESPONSE: FLOODING. In a reward training 
situation it is easy to imagine what will happen if the reinforcement is removed. If 
a rat is no longer given food for pressing a bar, it will stop pressing quite quickly. 
This is called extinction of the response. Similarly, if a high hurdler practices 10 
hours a day and never makes the team, he or she will probably stop practicing. 
But what about the avoidance situation? What do you think the rat will do if the 
shock is permanently removed from the situation? No more shock. Just a white 
box and a black box. Interestingly enough, it keeps right on jumping even though 
there is no need for it. It never waits long enough to find out that it will not be 
shocked. If left to its own devices, the rat will maintain its avoidance response for 
a very long time. 

There are ways to ensure extinction of avoidance responses. For example, one 
way is to make the barrier so high that the rat cannot jump over it. It is forced to 
stay in the white box and discover it will no longer be shocked. The rat is rather 
unhappy for a while, but sooner or later the response drops out. This technique 
for extinguishing the response is called flooding (Mineka & Gino, 1979). Another 
way to eliminate the avoidance response is to present the aversive stimulus no 
matter what action is taken. 

THE TWO-FACTOR THEORY OF AVOIDANCE CONDITIONING. One particular 
theory of avoidance conditioning has been very influential (Mowrer & Lamo- 
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reaux, 1942, 1946). According to this interpretation, avoidance conditioning 
involves both instrumental conditioning and classical conditioning (see Mineka, 
1979). When a rat is placed in the white box and shocked, a kind of classical 
conditioning occurs. The rat’s reaction to the shock becomes classically con¬ 
ditioned to the white box. Mowrer and others have called this conditioned 
reaction fear. One can think of the rat’s reaction to the shock as something else if 
one wishes, but fear is close enough for our purposes. The white box is the CS. 
The shock is the UCS. The fear elicited by the shock is the UCR. The fear elicited 
by the white box is the.CR. Thus, the first factor in the two-factor theory is the 
classical conditioning of a fear reaction. The second factor refers to the notion 
that the rat will learn to escape from the fear-producing situation in an in¬ 
strumental fashion. When the rat finds itself in the white box, it experiences 
conditioned fear. It then leaps the barrier. The termination of the fear-producing 
stimuli, and consequently the fear itself, is the reward, or reinforcement, for the 
escape behavior. 

In summary, the two-factor theory holds that a negative emotional reaction 
becomes classically conditioned to a particular set of stimuli. The animal then 
instrumentally learns to escape from these circumstances and is reinforced by the 
reduction in the conditioned emotional reaction. You are referred to Bolles 
(1970), Rescorla and Solomon (1967), and Wahlsten and Cole (1972) for discus¬ 
sions of the two-factor theory. 

LEARNED HELPLESSNESS. One interesting development in the field of avoidance 
conditioning has been the discovery of the learned helplessness phenomenon 
(Alloy & Bersh, 1979; Brown & Dixon, 1983; Maier, 1970; Maier, Seligman, & 
Solomon, 1969). If, before normal avoidance training, a dog has experienced 
inescapable shock, it will sometimes fail to learn the avoidance response. For 
example, if a dog is shocked repeatedly in a Pavlovian harness, without any 
opportunity for escape, and is then placed in an avoidance situation where shock 
can be avoided by leaping a barrier, the dog will sometimes remain relatively 
silent and motionless during the shock periods. It will fail to learn the escape 
response. After unavoidable shock the dog somehow operates as though action 
would be fruitless. It is without “hope.” 

Similar reactions have been observed in humans (Miller & Norman, 1979). 
Sometimes prisoners of war adopt an aggressive stance and attempt to subvert the 
purposes of their captors. Others become apathetic, listless, and without the will 
to escape or resist (Strassman, Thaler, & Schein, 1956). The difference between 
the two types of prisoner seems to lie in their attitude about the effectiveness of 
their own actions. The active, resisting prisoners believe that they can improve the 
situation through action, whereas the “helpless” prisoners believe that there is no 
hope. They feel that any action on their part would be futile or perhaps danger¬ 
ous. Similarly, Rowland (1977) has found that particularly unnerving environ¬ 
mental events, such as the death of a significant other, relocation, or retirement, 
can lead to a sense of helplessness, and even to death, in elderly people. 

Susceptibility to learned helplessness may be, in part, inherited. Anisman, 
Grimmer, Irwin, Remington, and Sklar (1979) report that they were able to breed 
rats selectively such that some lines were more susceptible to the phenomenon 
than others. 
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There are at least four ways of looking at, or theorizing about, learned help¬ 
lessness: 

Seligman’s interpretation. The original Seligman approach stressed the idea 
that the animal, during inescapable shock, learns that its own responses are 
independent of shock offset; they will not do any good. This interpretation 
emphasizes the notion of “hopelessness.” Seligman (1975) proposed that learned 
helplessness can serve as a model for human depression. Specifically, he felt that 
depression brought on by unpleasant uncontrollable events in our environment 
can best be thought of in terms of learned helplessness. 

Seligman’s idea was a bold and good one, but it proved to be too simple- 
minded to account for all instances of human depression. Specifically, it failed to 
account for cognitive events, which can influence and determine the nature of 
depression. The nature of our depression depends on how we think about the 
causes, nature, and consequences of that depression. For example, suppose two 
people fail an exam. One of them attributes the failure to personal inadequacy, 
whereas the second person decides that the test was so difficult and unfair that no 
one could have passed it. The first person’s self-esteem may suffer, whereas the 
second person’s won’t. In other words, the same event (failure) can lead to 
different feelings, depending on how we interpret the causes of that event. 

Abramson, Seligman, and Teasdale (1978) presented a sweeping reformulation 
of the model of depression to account for the role of cognition in depression. They 
proposed three important dimensions. The first of these, personal versus universal 
attributions, we have just considered. If you attribute failure to your own short¬ 
comings, you will feel worse than if you attribute that failure to some external 
cause. 

Seligman’s second dimension, specific versus global attributions, was intro¬ 
duced to account for the fact that depression and helplessness sometimes seem 
quite specific (I only fail at school) and sometimes quite general (I can’t do 
anything right). The earlier formulation, which argued that depression was simi¬ 
lar to what happens to dogs when they are unavoidably shocked, could not 
account for these subtle but substantial differences in depression. 

Finally, Seligman’s third dimension, stable versus unstable attributions, was 
developed to account for the fact that some depressions seem to last forever, 
whereas others go away. If you flunk an exam and conclude that you did so 
because you are dumb, then you may be depressed for quite a while. But if you 
conclude that you flunked because you momentarily panicked, then you may get 
over your depression soon. 

Seligman’s interpretation is interesting because it was first developed in connec¬ 
tion with basic animal research but was then expanded to account for variations 
in human depression that are the result of our thinking about who we are and 
what has happened to us. 

2. Competing-response interpretation. While Seligman and his colleagues have 
moved toward a consideration of how cognitive events affect depression, others 
have formulated alternative theories of the basic helplessness phenomenon. For 
example, some have argued that it is not the inescapability of shock that is critical 
to the phenomenon. They propose that the shock results in the development of 
motor responses that are incompatible with later escape. These competing re¬ 
sponses, learned during shock, later compete with the animal’s efforts to learn the 
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escape response. During shock, for example, the animal may find crouching to be 
at least partially rewarding. Then during the subsequent nonshock phase, when 
escape is possible, this learned crouching behavior will compete with, or interfere 
with, the active motor responses that are necessary for escape. According to this 
view, the animal is feeling neither helpless nor hopeless. It is merely continuing to 
do what has been rewarding in the past—that is, crouch (Anderson, Crowell, 
Cunningham, & Lupo, 1979; Anisman, de Catanzaro, & Remington, 1978; 
Bracewell & Black, 1974). 

3. Neurochemical interpretation. Anisman (1978) and others have stressed the 
biochemical events that occur during inescapable shock. Specifically, Anisman 
argues that shock (stress) depletes brain norepinephrine and perhaps causes other 
neurochemical changes as well. These chemical changes result in deficits in motor 
activity, which could account for the lack of escape learning. 

4. Dual-process model. Glazer and Weiss (1976) are sympathetic with the 
neurochemical interpretation. But they point out that the proposed chemical 
changes are only temporary. Such changes could account for learning deficits only 
over a period of a few hours, whereas the learned helplessness phenomenon has 
appeared when escape learning has been separated from inescapable shock by 
several months. For instance, Fenton, Calof, and Katzev (1979) shocked guinea 
pigs within 24 hours of their birth and found that the escape learning deficit still 
appeared when the animals were full grown. To account for these facts Glazer 
and Weiss (1976) have constructed a dual-process model. According to this 
interpretation, chemical changes are presumed to account for deficits in escape 
learning that occur over short periods of time, whereas long-term deficits are 
attributed to the effects of competing motor responses that have been learned 
during inescapable shock. 

OPOIDS AND LEARNED HELPLESSNESS. Recent work has led to the intriguing 
discovery that the brain itself produces opiate-like pain-suppressing substances 
(Leibeskind, Lewis, Shavit, Terman, & Melnechuk, 1983; Lewis & Leibeskind, 
1983). Apparently the brain has built-in pain suppressing systems. The guess is 
that, when pain becomes too great, these systems begin producing opiate-like 
substances that temper the pain. 

Opiates (morphine, heroin, and the like) are drugs derived from plants. Opoids, 
in contrast, are these more recently discovered analgesics produced by the brain 
itself. The opoids include, among others, the endorphins, which have received 
considerable attention. 

The emerging picture is that these pain-suppressing systems do not come into 
action too soon; if they responded to the mildest pain, the organism would never 
be warned of any danger. If pain is so great that it becomes maladaptive, however, 
it is to the animal’s advantage to produce opoids. Under conditions of stress one 
would expect opoids to be produced. 

What has all this to do with learned helplessness? Maier, Sherman, Lewis, 
Terman, and Leibeskind (1983) reasoned that, if helplessness is stressful, it should 
lead to opoid production. That is exactly what they found; learned helplessness is 
followed by opoid production. 

This is not a theory of helplessness. Rather, it is one more piece in the overall 
puzzle presented by the helplessness phenomenon. 
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Escape Training 

Another variety of instrumental conditioning is called escape training. In this 
situation inevitable unpleasant stimulation is continued unless the subject makes 
a particular response. Imagine the avoidance-learning situation just described, but 
with one slight change. In escape training the current is always on in the white 
box. The rat is dropped into the white box and is, obviously, shocked when it 
lands on the grid. It quickly learns to escape by leaping the barrier. The difference 
between avoidance and escape training is that in escape training the animal can 
never completely avoid the shock. No mattet how hard it tries, it is always 
shocked at the beginning of every trial. 

Many laboratory examples of escape training are available. Leeming and Little 
(1977) enclosed houseflies ( Musca domestica) in a Plexiglas tube. One-half of the 
tube was lighted, hot, and, presumably, very uncomfortable. The other half of the 
tube was relatively dark and cool. The houseflies quickly learned to escape from 
the uncomfortable side and to spend their time in the shady portion. 

Of course, escape training is not limited to “lower” animals and the laboratory. 
When an enthusiastic father tosses his screaming, 4-year-old nonswimmer into 
the family pool, the child is undergoing escape training. Swim or sink. The child 
hits that unpleasant water every time, just as the rat hits the electrified grid on 
every trial. (It is true that the child’s dislike of the water may lessen with exposure, 
but those first few tosses into the water surely represent escape training.) 

THE RELATIONSHIP BETWEEN ESCAPE AND AVOIDANCE TRAINING. If you think 
back to the discussion of avoidance training, you can see that avoidance training 
really involves classical conditioning of the fear reaction and instrumental escape 
training. In other words, the animal learns to escape the unavoidable fear that has 
been classically conditioned to the white box. It can learn to avoid the shock 
completely, but it cannot avoid the fear completely. It can only escape that 
unpleasant emotion, because fear is elicited every time it is placed in the white 
box, no matter how quickly it jumps the barrier. 

Punishment Training 

In punishment training the animal receives unpleasant stimulation when it makes 
a particular response. Suppose that a cat has been trained to run from a start box, 
through a maze, and into a goal box, where it is rewarded with food. If the 
situation is suddenly changed to one in which the animal is shocked in both the 
maze and the goal box, the animal will stop running and will spend most of its 
time in the safe starting box. If children have had a lot of rowdy fun in the first 
grade but are routinely punished for such disruptive behavior by their new 
second-grade teacher, they will probably drop the offensive action. Punishment 
training can also involve the withholding of anticipated positive stimuli as a 
means of teaching the subject what not to do, such as withholding dessert when a 
child misbehaves at dinner. Solomon (1964) argues that the essence of punishment 
training is that the animal is being taught what not to do, whereas in -escape and 
avoidance training the animal is being taught what to do. 

The effects of punishment are complicated and tricky. For example, some 
responses are much more susceptible to punishment than others. Shettleworth 
(1978) found that three different responses of golden hamsters were unequally 
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affected by punishment. Face washing, standing and rearing on the hind legs, and 
scrabbling (standing against a wall and moving the forelegs) were differentially 
affected. A second complexity in the punishment situation is the fact that certain 
responses will be affected by certain kinds of punishment but not by others. 
Klunder and O’Boyle (1979) investigated the effects of punishment on the tenden¬ 
cy of mice to attack and eat crickets. They found that when shock was used as the 
punisher, both attacking and eating could be reduced. But when the punisher was 
an injection of lithium chloride, which makes the mouse ill, the tendency to eat 
crickets was reduced by a much larger degree than was the tendency to attack the 
insects. Finally, Dunham (1978) points out that unpunished responses of various 
sorts can undergo changes during the punishment of some other response. 

Some additional factors that determine the effectiveness of punishment include 
the intensity of the punishment, the strength of the response to be punished, and 
the species of the subject. Many of the important variables influencing punish¬ 
ment are discussed by Solomon (1964) and Fantino (1973). 

In any case, the application of punishment may not always work out as 
planned. If, for example, you wish to stop your dog from escaping through a hole 
in the backyard fence, you may wait patiently by the hole until the dog attempts 
to go through. As it passes through the opening, you may swat it with a 
newspaper. That is punishing a particular response, or is it? Maybe not. Your dog 
may well avoid you for two weeks and continue to use the hole in the fence. 

AVERSION THERAPY. Punishment has been used successfully in eliminating many 
human responses. In aversion therapy, which is essentially a punishment tech¬ 
nique, unpleasant stimuli are associated with unwanted behaviors. Aversion 
therapy is often used to try to break “bad habits.” 

In one interesting study Morosko and Baer (1970) attempted to treat alcoholics 
by using a punishment procedure. Subjects were presented with six cups of liquid. 
Four of them contained nonalcoholic beverages, and the other two contained 
alcoholic beverages. Initially, the subjects were required to drink the contents of 
all six cups. They were shocked as they drank the two alcoholic beverages. On 
subsequent trials the subjects were allowed to omit the alcoholic beverages and 
thereby avoid the concomitant shock. Consumption of alcohol dropped marked¬ 
ly. The avoidance effect persisted beyond the laboratory situation. Follow-up 
work indicated that many of the subjects decreased their everyday drinking (see 
also Wilson, Leaf, & Nathan, 1975). 

Goldiamond (1965) reduced stuttering by following it with delayed feedback of 
the subject’s own voice. (Delayed feedback is highly aversive to most people.) In 
addition, Pierce and Risley (1974) effectively reduced certain unwanted behaviors 
in an urban recreational center by following them with punishing stimuli. Each 
time individuals engaged in such acts as leaving trash about, leaving pool cues 
out, crowding in line, arguing, or breaking things, they were punished by restric¬ 
tions on their recreation time. Depending on the severity of the offense, from 1 to 
15 minutes of recreation time were canceled. Many additional instances of 
aversion therapy are available. For example, Singh (1979) reduced breath holding 
in a 15-month-old boy by administering the smell of ammonia (see also Doke, 
Wolery, &C Sumberg, 1983). Gray (1979) reports a case in which a child who 
constantly pulled out her own hair (a condition known as trichotillomania) was 
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slapped four times on the hand each time she pulled her hair. She stopped. 
Aversion therapy can also be used to reduce obesity, overeating, and smoking. 
Clearly, aversion therapy should be used with great caution. The administration 
of pain must be carefully considered (see Lundin, 1974). 

STOP THOUGHT. An interesting variation on the punishment theme involves 
stop thought (Turner, Holzman, & Jacob, 1983; Tyron, 1979; Tyron & Palladi- 
no, 1979). Stop thought, although controversial, is often taken as an effective way 
of eliminating obsessive thoughts, which we cannot seem to get out of our mind. 
They can be mild (Did 1 leave the lights on?) or very severe (I want to hurt 
someone). In all cases, they recur and become disruptive in our life. 

In the stop thought technique subjects are trained to shout “Stop!” either aloud 
or to themselves, each time the obsessive thought begins. The sharp command 
“Stop!” is thought of as a mild punishment. In some cases the subject is asked to 
imagine a therapist or other authoritarian figure shouting “Stop!” The technique 
appears to be at least partially successful in reducing obsessive thoughts. 

THE RELATIONSHIP BETWEEN AVOIDANCE AND PUNISHMENT TRAINING. Mowrer 
(1960) has argued that punishment training is not that different from avoidance 
training. He writes that in both cases fear is classically conditioned and then 
instrumentally escaped. As we already know, in avoidance conditioning a rat’s 
fear is classically conditioned to the white box. The rat then learns instrumen¬ 
tally to escape from that situation. In punishment training the fear is classically 
conditioned to the punished response itself. The response itself is then avoided. 
It is as though the thought of making the punished response makes us afraid to 
do it. 

Keep in mind that, although distinctions have been made between reward, 
avoidance, escape, and punishment training, these distinctions are operational 
and descriptive. A little thinking has led us to the position that the four types have 
a good deal in common. 

Positive Versus Negative Reinforcement 
Positive reinforcement usually refers to anything that, if presented to the organ¬ 
ism following a response, will increase the probability of that response’s occur¬ 
ring. For example, if a hungry rat presses a bar and is then given food, that 
reinforcement is called a positive one. Things such as food, water, sex, and the 
opportunity to explore are usually considered to be positive reinforcers. 

Negative reinforcement usually refers to something aversive that, if taken away 
from the animal, will increase the probability of a response’s occurring. For 
example, if an animal leaps a barrier and shock is removed , then the probability 
of the escape response will have been strengthened. Generally speaking, noxious 
stimuli such as shock are considered to be negative reinforcers. They must be 
removed to obtain the reinforcing effect. 

Now, having made this neat distinction between positive and negative reinforc¬ 
ers, can we argue that it might be a superficial one? Consider the rat pressing the 
bar for food. Does this situation involve a positive or a negative reinforcer? We 
are tempted to say positive, because we are presenting something to the animal 
following its response. Is it possible, though, that the reinforcing factor in this 


46 PART ONE/INTRODUCTION 













situation is the removal of hunger pangs and not the mere presentation of the 
food? Is it possible that this situation involves a negative reinforcer? It is as 
though psychologists have distinguised between positive and negative reinforc¬ 
ers not on the basis of what is important to the animal but on the basis of what 
they do in the experimental situation. Positive and negative reinforcers may be 
opposite sides of the same coin. As far as the animal is concerned, the presenta¬ 
tion of food may normally be the same as taking away hunger pains. The removal 
of shock may be the same as presenting the state of not being shocked. 

If we entertain the possibility that positive and negative reinforcers may be 
opposite sides of the same coin, then we must consider the possibility that reward 
training and escape training are identical. In reward training the rat may merely 
be learning to escape from the unpleasant state of being hungry or otherwise 
deprived. In escape training the rat may be reinforced by the presentation of the 
state of nonshock. The two processes may well coincide. When we are hungry and 
seek food, we may well be reinforced by the presentation of food and the removal 
of hunger pangs. It is difficult to separate the presentation of pleasant factors and 
the removal of unpleasant ones. 

Some interesting experiments bear on this issue. They suggest that both the 
presentation of food and the removal of hunger reinforce behavior. A discussion 
of these studies will be delayed until a later section. 

Responding Versus Not Responding 

If an animal is punished for making a particular response and ceases to make that 
response, it is customary to say that we have taught the animal what not to do. A 
response has been eliminated. But is that the case? Is it not possible to think of the 
act of not responding as an actual response in some cases? When we try not to be 
rude to others, because we know it will lead to punishment, are we not actually 
engaging in some kind of response? If we think of the act of not responding, or 
inhibiting a response, as an active response, then punishment training and reward 
training become closely related. Punishment training can be thought of as reward 
training in which the response of not responding is rewarded by the absence of 
punishment. Before we accept the psychologists’ distinction, we should think 
carefully about the meaning of their terms. These definitions are not as clean and 
crisp as we would like them to be. 

Behavior Modification and Cognitive Behavior Modification 
In exploring the various classical and instrumental types of learning we have seen 
many examples of the ways in which practical human problems can sometimes be 
solved. Many of these examples involve behavior modification. Behavior mod¬ 
ification is a very general term referring to any effort to alter human behavior 
through the application of the basic principles of conditioning. Traditionally, 
behavior modification has focused on changing overt, observable, noncognitive 
behavior. It has been thought of as an alternative to the psychotherapies that 
essentially try to change the way people think and act by engaging them in 
conversation. Behavior modification techniques, for the most part, have tried to 
avoid “what goes on in the mind’’ and tried to emphasize the impact of reinforce¬ 
ment techniques on observable behaviors such as temper tantrums, drinking, 
smoking, and so on. Although behavior modification has addressed some cogni- 
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tive problems (for example, phobias, fears), the orientation has been to treat the 
human as. though he or she were a laboratory animal whose physical behavior 
could be altered in much the same manner that we alter the behavior of the white 
rat—physically, nonverbally, and noncognitively. 

Beginning in the mid-1970s a new orientation began to take hold within the 
ranks of behavior modifiers. The term cognitive behavior modification began to 
appear more and more often. Proponents of cognitive behavior modification, or 
CBM, argue that, if we can change the way people think, then we can assist them 
in their efforts to adjust. If we can alter the' ways in which people describe 
themselves, if we can alter their beliefs, and if we can train them in a variety of 
coping skills, such as information seeking, anticipation of problems, and relaxa¬ 
tion, then we may be able to improve their lot. 

A number of criticisms have been leveled against CBM. Aside from the fact that 
it is difficult to define, Ledwidge (1978, 1979) argues that CBM is a step in the 
wrong direction. He asserts that CBM is taking us back to mentalistic concepts 
such as thoughts and cognitions, which are next to impossible to define or 
measure. In a sense, CBM is turning back toward psychotherapeutic methods, 
where words and talk are the elements involved in changing behavior. Thus, CBM 
represents a turn toward just what behavior modification theorists first wanted to 
avoid—having to deal with elusive, unobservable elements such as thoughts, 
ideas, and images. 

Criticism of CBM has not gone unanswered (see Locke, 1979; Mahoney & 
Kazdin, 1979; Meichenbaum, 1979b). Essentially, the CBM position is that a 
thought is as much a behavior as a bodily movement and, as such, should be 
subject to ordinary principles of learning. A thought, or a cognition, is not 
something mysterious and qualitatively different from a physical act. Even though 
it may be more difficult to measure, it is, nonetheless, a behavior. The CBM 
position holds that, although thoughts, beliefs, and cognitions might have been 
too difficult for us to study at one time, the time has now arrived for us to 
recognize their importance in our efforts to alter the human condition. 

The crux of their argument is that we can no longer deny the importance of 
thought in determining behavior. Thoughts and physical actions interact with 
each other and affect each other. To ignore the power of thought in determining 
behavior is shortsighted and an obvious and enormous oversimplification. 

Whatever the final outcome—which is hoped would be a softening and a 
blending of these two viewpoints—CBM seems to be firmly established (see 
Barrios & Shigetomi, 1979; Hollandsworth, Glazeski, Kirkland, Jones, & Van 
Norman, 1979; Mahoney, 1979; Meichenbaum, 1979a; Moon & Eisler, 1983). 

MIXED METHODS: ASSERTION TRAINING. Are behavior modification efforts 
either cognitive or noncognitive? Quite often the answer is no. Sometimes cogni¬ 
tive and noncognitive elements seem to be intertwined and intermingled within 
the same treatment. 

Assertion training provides a good example of the way in which cognitive and 
noncognitive factors may be involved in a given treatment technique. Many of us 
are not assertive enough. We tend to become frustrated because we feel that we 
are being used or pushed around by others. When an unreasonable request is 
made of us (“Lend me 20, will you?”), we tend to say yes instead of politely but 
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firmly saying no. We have not developed the technique of saying no. Assertion 
training (see Derry & Stone, 1979) is a technique designed to help us say no. We 
can learn to refuse to do what we should not be expected to do, and we can learn 
to ask for what we deserve. 

Some steps in a sample assertion training program are given in Table 2.2. As 
you read through these steps, notice the blend, or mixture, of cognitive and 
noncognitive steps. Sometimes you are asked to change your ways of thinking, 
whereas at other points you are encouraged to change your behaviors. In other 
words, it may eventually prove to be impossible to separate, in any concrete way, 
cognitive and noncognitive behavior modification techniques. The distinction 
between cognitive and noncognitive behavior modification, which is receiving so 
much attention, may eventually fall by the wayside. 

Table 2.2 Steps in Assertion Training 


1. Examine your interactions. Are there situations that you need to handle more assertively? 

Do you at times hold opinions and feelings within you for fear of what would happen if you 
expressed them? Do you occasionally blow your cool and lash out angrily at others? Studying 
your interactions is easier if you keep a diary for a week or longer, recording the situations in 
which you acted timidly, those in which you were aggressive, and those that you handled 
assertively. 

2. Select those interactions in which it would be to your benefit to be more assertive. They may 
include situations in which you were overly polite, overly apologetic, timid, and allowed 
others to take advantage of you, at the same time harboring feelings of resentment, anger, 
embarrassment, fear of others, or self-criticism for not having the courage to express yourself. 
Overly aggressive interactions in which you exploded in anger or walked over others also need 
to be dealt with. For each set of nonassertive or aggressive interactions, you can become more 
assertive, as shown in the next steps. 

3. Concentrate on a specific incident in the past. Close your eyes for a few minutes and vividly 
imagine the details, including what you and the other person said and how you felt at the time 
and afterward. 

4. Write down and review your responses. Ask yourself the following questions to determine how 
you presented yourself: 

a. Eye contact—did you look directly at the other person, in a relaxed, steady gaze? Looking 
down or away suggests a lack of self-confidence. Glaring is an aggressive response. 

b. Gestures—were your gestures appropriate, free flowing, relaxed, and used effectively to 
emphasize your messages? Awkward stiffness suggests nervousness; other gestures (such as 
an angry fist) signal an aggressive reaction. 

c. Body posture—did you show the importance of your message by directly facing the other 
person, by leaning toward that person, by holding your head erect, and by sitting or standing 
appropriately close? 

d. Facial expression—did your facial expression show a stern, firm pose consistent with an 
assertive response? 

e. Voice tone and volume—was your response stated in a firm, conversational tone? Shouting 
may suggest anger. Speaking softly suggests shyness, and a cracking voice suggests 
nervousness. Tape recording and listening to one's voice is a way to practice increasing or 
decreasing the volume. 

f. Speech fluency — did your speech flow smoothly, clearly, and slowly? Rapid speech or 
hesitation in speaking suggests nervousness. Tape assertive responses before you try them 
out in problem situations, so you can practice and improve your fluency. 

g. Timing—were your verbal reactions to a problem situation stated at a time closest to the 
incident that would appropriately permit you and the other person time to review the 
incident? Generally, spontaneous expressions are the best, but certain situations should be 
handled at a later time—for example, challenging some of your boss's erroneous statements 
in private rather than in front of a group that ne or she is making a presentation to. 

h. Message content—for a problem situation, which of your responses were nonassertive or 
aggressive, and which were assertive? Study the content and consider why you responded in 
a nonassertive or aggressive style. 

5. Observe one or more effective models. Watch the verbal and nonverbal approaches that are 
assertively used to handle the types of interactions with which you have been having problems. 
Compare the consequences between their approach and yours. If possible, discuss tneir 
approach and their feelings about using it. 
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Table 2.2 Continued 


6. Make a list of various alternative approaches for being more assertive. 

7. Close your eyes and visualize yourself using each of the above alternative approaches. For 
each approach, think through what the fullset of interactions would be, along with the 
consequences. Select an approach or combination of approaches that you believe will be most 
effective for you to use. Through imagery, practice this approach until you feel comfortable that 
it will work for you. 

8. Role-play the approach with someone else, perhaps a friend or counselor. If certain segments of 
your approach appear clumsy, awkward, timid, or aggressive, practice modifications until you 
become comfortable with the approach. Obtain feedback from the other person about the 
strengths and shortcomings of your approach. Compare your interactions with the guidelines 
for verbal and nonverbal assertive behavior to Step 4. It may be useful for the other person to 
role-play one or more assertive strategies, which you would then practice by reversing roles. 

9. Repeat Steps 7 and 8 until you develop an assertive approach that you are comfortable with and 
believe will work best for you. 

10. Use your approach in a real-life situation. The previous steps are designed to prepare you for 
the real event. Expect to be somewhat anxious when you first try to be assertive. If you are still 
too fearful of attempting to be assertive, repeat Steps 5—8. For those few individuals who fail 
to develop the needed confidence to try out being assertive, professional counseling is 
advised—expressing yourself and effective interactions with others are essential for personal 
happiness. 

11. Reflect on the effectiveness of your effort. Did you “keep your cool”? Consider the nonverbal 
and verbal guidelines for assertive behavior discussed in Step 4; what components of your 
responses were assertive, aggressive, and nonassertive? What were the consequences of your 
effort? How did you feel after trying out this new set of interactions? 

12. Expect some success with your early efforts but not complete personal satisfaction. Personal 
growth and interacting more effectively with others is a continual learning process. 


Adapted from Zastrow, C. How to become more assertive. In C. Zastrow and D. H. Chang (Eds.), The personal problem solver. 
Englewood Cliffs, N.J.: Prentice-Hall, 1977, pp. 238—240. Reprinted by permission of Prentice-Hall, Inc. 


SELF-DIRECTED INTERVENTION. Traditionally, behavior modification efforts 
have involved external control in the sense that rewards and punishments have 
been manipulated by someone other than the subjects. Thus, the experimenter 
has been in charge of deciding which of the subjects’ actions should be rewarded 
or punished. But this is expensive in terms of time and effort. If it would work, a 
much more practical procedure would be to have the subjects themselves monitor 
their own behavior and decide for themselves when they do and do not deserve a 
reward. This process of self-direction does away with the impractical necessity of 
having someone else constantly watch the subjects. 

Recent years have seen many efforts to establish such self-directed interven¬ 
tions. For example, Black and Friesen (1983) had obese subjects make a monetary 
deposit. They could “win back” this deposit only if they lost weight. Traditional¬ 
ly, the subjects would have had to come in and be weighed by someone else to see 
if they had lost weight. But in their minimal intervention program Black and 
Friesen let the subjects make all the decisions about how much weight had been 
lost and how much of the deposit they deserved to have returned. The self- 
directed program resulted in significant weight loss (see also Burgio, Whitman, & 
Reid, 1983). 

O’Brien, Riner, and Budd (1983), found that with a little initial help children 
were able to reward themselves for doing what their mother wanted them to do in 
the home. Mother didn’t control the rewards; the children did. The children 
monitored their own behavior and gave themselves a token each time they did 


50 PART ONE/INTRODUCTION 















something that was desired. Tokens could later be exchanged for more tangible 
rewards. 

Although self-directed experiments look hopeful and allow for the tailoring of 
individual programs (Straw & Terre, 1983) they may not be appropriate for all 
problems. For example, Holden, O’Brien, Barlow, and Stetson (1983) report that 
self-directed programs are relatively ineffective in helping agoraphobics (those 
who fear open spaces). 


SUMMARY 

Classical Conditioning 

1. Classical conditioning is a widespread phenomenon. Examples include eye 
blink conditioning, aversive conditioning using sickness as the UCR, seman¬ 
tic conditioning, masochism, and phobias. 

2. The temporal spacing of the CS and the UCS is an important variable in the 
study of classical conditioning. Typical CS—UCS patterns include delayed, 
simultaneous, trace, and backward procedures. 

3. A CR may be observed by leaving out the UCS occasionally or, in the case of 
delayed and trace procedures, by observing the development of the CR 
before the onset of the UCS. 

4. The CR is not the same as the UCR. The CR has been viewed as a component 
of the UCR and as a response that prepares the organism for the UCS. 

5. Responses within the typical conditioning situation are not limited to CRs 
and UCRs. For example, random blinks, voluntary responses, and UCRs to 
the CS (alpha responses) have been identified in eye blink conditioning. 

6. Pseudoconditioning refers to the fact that a CS, when presented alone follow¬ 
ing a long series of solitary UCS presentations, will often elicit a response 
resembling a CR. Pseudoconditioning has been interpreted in terms of gener¬ 
alized excitement and as a case of true classical conditioning. 

7. The length of the CS—UCS interval is a critical variable in classical condi¬ 
tioning. In many cases an interval of about .5 of a second seems to be 
optimal, but this optimal interval interacts with other variables, such as CS 
complexity. 

8. CS and UCS intensity is positively related to strength of conditioning. 

9. A recent trend in the field, multiple-response measurement, has focused on 
the fact that classical conditioning involves many different, and perhaps 
related, changes within the organism. 

10. Latent inhibition (the CS preexposure effect) refers to the fact that repeated 
presentations of the CS alone, before paired presentation of the CS and UCS, 
will sometimes lead to poor conditioning. 

11. The UCS preexposure effect refers to the finding that repeated presentations 
of the UCS, before paired presentation of the CS and UCS, will also lead to 
poor conditioning. 

12. A compound CS is a CS composed of two or more discrete components. 

13. Compound stimulus effects include sensory preconditioning, overshadow¬ 
ing, compounding, blocking, conditioned inhibition, and serial compounding. 

14. Recent informational theories of classical conditioning argue that condition¬ 
ing occurs only when the CS predicts, or carries information about, the 
upcoming UCS. 
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15. The polygraph is based on classical conditioning procedures. 

Instrumental Conditioning 

1. Instrumental reward training refers to a situation in which the animal is 
rewarded when it makes a particular response. 

2. Shaping refers to the reinforcement of closer and closer approximations to a 
desired response. 

3. Tokens, money, social rewards, and many other reinforcers have been used to 

reward many human behaviors. .. 

4. In avoidance training the animal is punished when it fails to make a particu¬ 
lar response. 

5. Relative to reward-trained responses, avoidance-trained responses are ex¬ 
tremely durable under conditions of extinction, but flooding can eliminate 
them. 

6. The two-factor theory of avoidance conditioning argues that “fear” is classi¬ 
cally conditioned to the punishment situation and that the animal then 
learns, in an instrumental fashion, to escape from this fear. The reduction in 
fear is the reinforcer. 

7. Learned helplessness refers to the fact that avoidance learning following 
inescapable shock will sometimes be very poor. 

8. Four theories of learned helplessness include the Seligman formulation, the 
competing-response interpretation, the neurochemical interpretation, and 
the dual-process model. Opoids may be produced by the brain during learned 
helplessness. 

9. In escape training, inevitable punishment is continued unless the animal 
makes a particular response. 

10. In punishment training the animal is punished when it makes a particular 
response. Aversion therapy involves punishment. 

11. Mowrer argues that punishment and avoidance training are essentially the 
same thing. In avoidance training “fear” is classically conditioned to the 
external situation, whereas in punishment training “fear” is somehow classi¬ 
cally conditioned to the punished response itself. 

12. Positive reinforcement usually refers to anything that, if presented to the 
animal, will increase the probability of a response’s occurring. Negative 
reinforcement usually refers to anything that, if taken away from the animal, 
will increase the probability of a response’s occurring. 

13. The distinction between positive and negative reinforcers may be a superficial 
one. The presentation of something positive may not be different from the 
removal of something negative. 

14. It is possible that the act of not responding is, in some situations, an actual 
response. If this position is taken, then many of the distinctions among the 
different types of conditioning become blurred. 

15. Traditional behavior modification is currently being influenced by cognitive 
behavior modification. 

16. Assertion training involves both forms of behavior modification. 

17. Self-directed minimal intervention procedures are being used more and more 
often. 
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INTRODUCTION 

In this chapter we shall examine some of the important similarities and differ¬ 
ences among the various learning and memory tasks commonly utilized in re¬ 
search. These tasks (for example, classical and instrumental conditioning, verbal 
learning tasks, and cognitive tasks) appear to be quite distinct on a procedural 
level. Are we sure, however, that these different tasks reflect different underlying 
learning processes, or is it possible that they are encompassed within some 
common learning process? Do we have the ability to learn in more than one way? 

Several representative situations will be examined. (Other examples could have 
been chosen, but these will suffice.) They include instrumental conditioning, 
classical conditioning, paired-associate (PA) learning, language learning, and 
priming in lexical decisions. First, PA learning will be examined in relation to 
other types: Is it more like instrumental conditioning, or is it more closely related 
to classical conditioning? Does it clearly represent a totally separate kind of 
learning? Next, we shall briefly consider the possibility that language learning and 
lexical-decision situations are closely related to the other situations under consid¬ 
eration. Finally, differences between classical and instrumental conditioning will 
be examined. Are the traditional differences between these two types of learning 
substantial, or might they be superficial ones that do little to uphold the position 
that classical and instrumental conditioning are fundamentally different? 

This exercise is limited in that it considers only selected learning tasks. If it can 
be shown that these types are closely related, there is no guarantee that all other 
types of learning (for example, problem solving and sequential learning) will also 
be closely tied together. But the weight of the evidence would suggest that such a 
possibility should be taken seriously. 

The implications of these questions are far reaching. If, on the one hand, it is 
found that humans possess the capacity to learn in several different ways, perhaps 
utilizing different levels or aspects of the nervous system, then theories, laws, 
rules, models, and generalizations governing each of these learning capacities will 
have to be developed. If, on the other hand, it is eventually determined that some 
underlying process can account for all the disparate types of learning situation, 
then some single, fundamental explanatory system may suffice. 

We should be aware of a danger here. A lack of precision in our thinking about 
learning tasks may lead us to “find” similarities among tasks when they do not 
really exist. Thus, we must be careful not to overemphasize similarities when 
those similarities may be superficial. We seek neither to accept unwarranted 
similarities nor to dismiss legitimate ones. Rather, we wish to grasp all the 
possible interpretations of some fairly complicated situations. 

PAIRED-ASSOCIATE LEARNING 

Paired-associate learning, essentially a human task, requires the subject to associ¬ 
ate pairs of verbal units. Does this learning task definitely represent some unique 
form of learning, or is it possible that it may be included in some other category, 
such as classical or instrumental conditioning? 

It can be argued that PA learning is an instance of instrumental reward training. 
Suppose that a subject is required to associate the nonsense syllable BEV with the 
nonsense syllable JAX. When the computer exposes JAX, the subject’s task is to 
recall and say aloud “BEV.” The reward, or reinforcement, for making the correct 
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Figure 3.1 Schematic representation of paired-associate learning as an instance of 
classical conditioning. 


response comes in the form of response confirmation. The computer, by exposing 
BEV, informs the subject that he or she was correct in saying “BEV.” Making 
correct responses is definitely rewarding for the average subject in this situation. 
Thus, the PA situation contains many of the elements of the typical Skinner box 
situation. The human sees the stimulus word, makes a response, and is rewarded. 
The rat sees the bar, presses it, and is rewarded. 

This does not mean we have proved that paired associates are learned in 
exactly the same way that the rat learns to press a bar. Our mental activities 
(organizing, testing hypotheses) are quite a bit more complicated than the rat’s 
mental operations. But it does suggest that the essential observable qualities of the 
two situations (stimulus, response, and reinforcement) are duplicated. In other 
words, it would seem premature to conclude that instrumental conditioning and 
PA learning represent clearly different types of learning. 

The situation is complicated by the fact that PA learning can be thought of as 
an instance of classical rather than instrumental learning (Figure 3.1). Consider 
the JAX—BEV pair. Assume, for the sake of argument, that JAX is a CS and that 
the printed syllable BEV is a UCS. In the PA learning situation JAX (the CS) is 
presented just before BEV (the UCS). This corresponds to a delayed classical 
conditioning procedure. When the printed syllable BEV is presented, the subject 
reacts to it. That is, the subject thinks, or perhaps even says, “BEV.” The saying or 
thinking of “BEV” may be considered the UCR. It happens every time the printed 
syllable BEV is presented. At first JAX (the CS) does not elicit the thought or the 
verbalization “BEV.” As JAX (CS) and BEV (UCS) are paired over and over, JAX 
(CS) begins to elicit the thought or verbalization “BEV.” When JAX (CS) elicits 
“BEV,” we call that “BEV” a CR. Note that, as conditioning progresses, “BEV” 
(CR) begins to occur in anticipation of the UCS (BEV). It corresponds to the 
pattern obtained in classical conditioning, where the CR follows the CS and 
precedes the UCS. 

Interpretations of PA learning have not been limited to classical and in¬ 
strumental learning. Many investigators have begun to think of human learning 
and retention in terms of information processing, with an emphasis on storage 
and retrieval mechanisms (Bourne, Dominowski, & Loftus, 1979; Lindsay & 
Norman, 1972). A detailed consideration of the information-processing approach 
will be discussed in later sections. 

In summary, PA learning cannot clearly be distinguished from classical and 
instrumental conditioning. It may or may not represent a distinct learning pro¬ 
cess. Probably the most scholarly stance to assume is one based on an, awareness 
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of the many alternative ways of thinking about PA learning—and about human 
learning in general. 


LANGUAGE LEARNING 

In a later chapter we will explore natural language learning. Without doubt, 
language is a powerful tool, and our ability to learn and use this tool sets us apart 
from most of the rest of the animal kingdom. As we shall see, language acquisi¬ 
tion involves complex processes that are not as yet fully understood. But it is 
instructive at this juncture to point out that our now familiar friends, classical and 
instrumental conditioning, as well as paired-associated learning, have much in 
common with at least some aspects of language learning. 

For example, PA learning, at least on some level, resembles what we do when 
we attach verbal meaning to words. When we learn that ocho in Spanish means 
eight in English, we are forming a paired associate. When we learn definitions, we 
are engaging in a rough form of PA learning. The whole process of associating 
verbal meanings with verbal units resembles PA learning. Of course, there are 
alternative interpretations of language acquisition that we shall explore, but at 
the same time it is difficult to deny that at least some aspects of language learning 
resemble PA learning. 

Similarly, some aspects of language acquisition can be thought of in terms of 
classical conditioning. For example, the word dad may acquire some of its 
emotional meaning through classical conditioning. The real father is the UCS. 
The spoken word dad is the CS. The real father (UCS) elicits some complex 
emotional response (UCR) in the child. This emotional response becomes con¬ 
ditioned to the word dad (CS), such that when the word dad occurs in the absence 
of the actual father, a CR (resembling the UCR) will occur. 

Finally, language acquisition can be thought of as resembling instrumental 
conditioning. It is like reward training; each time a child says the word mama , he 
or she receives love, attention, and praise. Using words is instrumental in getting 
what we want. 

But do not be misled by these parallels between language learning and the other 
forms of learning. As we shall see in Chapter 13, language learning is complex 
and may involve many different processes. 

PRIMING 

The priming procedure in experiments involving lexical decisions is, relative to 
tasks such as classical and instrumental conditioning, a newcomer in the field of 
learning and memory. As you will recall from Chapter 1, a lexical-decision task 
requires subjects to make some decision about a briefly exposed item, such as 
deciding if the item is a true word or if it has an H in it. The priming effect refers 
to the fact that these decisions are made more quickly and accurately if, before the 
subjects see the briefly exposed target item, they see a word that is related to that 
target. For example, people more quickly recognize that canary is a true word if 
they are first shown bird rather than first being shown rock. The argument is that 
bird somehow activates canary and makes it more easily processed. This priming 
technique is thought to be useful in studying how words are organized in our 
vocabulary. 
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This task seems pretty different from classical and instrumental conditioning, 
doesn’t it? But if we think about it for a moment, some of these differences 
disappear. Specifically, priming is really only testing learning that occurred before 
the experiment began. It is testing a person’s vocabulary, and that vocabulary was 
certainly well established before the subject walked in and began the priming 
experiment. And there is every reason to assume, as we learned in the immediately 
preceding section, that classical and instrumental learning may be involved in 
language learning, including vocabulary learning. Thus, priming may be nothing 
more than a way to test the strength of learning that was accomplished through 
classical or instrumental events long before the priming experiment began. 

COMPARING CLASSICAL AND INSTRUMENTAL CONDITIONING 
Now that it has been suggested that any number of different human verbal tasks 
can be thought of as being closely related to classical and instrumental condition¬ 
ing, the relationship between these two learning tasks can be addressed directly. 
Are they related forms of some underlying process? The answer to this question 
has been shifting slowly. Twenty years ago it was assumed that instrumental and 
classical conditioning represented two distinct learning systems. More recent 
research has suggested that this traditional distinction may have to be abandoned 
or, at least, modified. 

Our discussion will proceed in three stages. First, we will look at some of the 
problems inherent in trying to compare the two types of conditioning. Second, we 
will examine in detail some of the attempts to condition the same response both 
classically and instrumentally. Finally, we will explore the ways in which the 
traditional distinction, so firmly held for so many years, is beginning to be 
questioned. 


Problems Inherent in the Comparison 
It may well be that we never have a case of pure classical conditioning or a case of 
pure instrumental conditioning. Instrumental conditioning may always involve 
classical conditioning, and classical conditioning may always possess attributes of 
the instrumental paradigm. 

INSTRUMENTAL CONDITIONING IN CLASSICAL CONDITIONING. In classical eye 
blink conditioning, the CR may reduce the impact of the noxious UCS. By 
blinking, the subject partially avoids the air puff. To the extent that the CR does 
avoid the air puff, it becomes instrumental in the animal’s life. It affects its future 
well-being. Hence, this classical situation possesses attributes of the instrumental 
paradigm. Other types of classical conditioning also involve reinforced in¬ 
strumental learning. In the salivary conditioning situation meat powder (UCS) 
eaten after salivation may promote digestion or may be more pleasant than dry 
meat powder eaten without preliminary salivation. Hence, the conditioned sali¬ 
vary response may be reinforced. It is difficult to be certain about these things, but 
the idea that classical conditioning may involve instrumental conditioning has 
been discussed by a number of authors (Coleman, 1975; D’Amato, Fazzaro, & 
Etkins, 1968; Dickinson &: Mackintosh, 1978; Hilgard &£ Marquis, 1940; Res- 
corla Wagner, 1972). The issue lives on. For example, Miller, Greco, Vigorite, 
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and Marlin (1983) recently reported evidence for the idea that a CS warns the rat 
of upcoming tail shock. The rat then prepares for that unpleasant shock experi¬ 
ence, either by bracing against it or perhaps even by releasing opiate-like endor¬ 
phins in the brain. On the other side of the issue we have Domjan and Burkhard 
(1982) arguing just as strongly that classical conditioning can occur in the absence 
of this kind of reinforcing event. 

CLASSICAL CONDITIONING IN INSTRUMENTAL CONDITIONING. Just as classical 
conditioning never seems to be quite pure, so instrumental conditioning never 
seems to be completely free of classical conditioning elements. The case of 
instrumental avoidance learning has already been examined; there may be classi¬ 
cal conditioning of “fear” followed by instrumental escape from that “fear.” Even 
more interesting is the fact that classical conditioning has been implicated in 
simple reward training (Trapold & Overmeir, 1972). If a rat is trained to press a 
bar in a Skinner box and is reinforced for doing so, then it experiences the pairing 
of the Skinner box (CS) with the food (UCS). The positive reaction to the food, 
whatever you want to call it, becomes conditioned to the situation such that, 
when the rat is placed in the Skinner box in the future, it will experience a CR. 
This CR has been called, among other things, “hope,” conditioned excitement, 
and incentive motivation. A rough counterpart of this kind of CR would be what 
you might experience when you are very, very hungry and approach your favorite 
restaurant. You can almost taste the food. The closer you get, the more excited 
you become. The conditioned excitement seems to pull you toward your goal. It 
somehow interacts with your instrumental behaviors. 

The upshot of all this is the current popular emphasis on the idea that classical 
conditioning is always present in instrumental situations. Furthermore, classical 
conditioning is assumed to provide some of the motivation for the instrumental 
behavior. Whether it is classically conditioned “fear” or “hope,” these condi¬ 
tioned responses are such that they provide some of the push behind the in¬ 
strumental response (Lovibond, 1983). 

In summary, even if classical and instrumental conditioning are different, they 
are linked so inextricably that for all practical purposes, attempts to separate 
them may finally prove to be futile. 

Attempts to Condition the Same Response Classically 
and Instrumentally 

Early experimenters assumed that the way to separate classical and instrumental 
conditioning would be to isolate one response and then condition that response 
both classically and instrumentally. If one or the other conditioning procedure 
yielded better conditioning of that given response, then it could be argued that the 
two procedures do differ and that one is superior to the other. Although the 
approach makes sense, the results have been equivocal. Sometimes instrumental 
conditioning leads to faster conditioning (Brogden, Lipman, & Culler, 1938), and 
sometimes classical conditioning is superior (Logan, 1951). 

In the former study guinea pigs were shocked (the UCS) in a revolving exercise 
wheel. The shock produced running (the UCR). A buzzer (CS) was paired with 
the shock. In the classical, or nonavoidance, condition the animal was shocked 
whether it ran or not. In the instrumental, or avoidance, condition the animal was 
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not shocked if it ran but was shocked if it failed to run. The group that could 
avoid the shock (instrumental conditioning) did a lot more running over a series 
of trials than did the group that received the shock no matter what it did. This 
study seems to suggest that instrumental conditioning is superior to classical 
conditioning. 

In contrast, in the Logan (1951) study an eye blink (CR) either avoided an air 
puff (instrumental conditioning) or did not avoid an air puff (classical condition¬ 
ing). The nonavoidance, or classical, group yielded faster conditioning than did 
the avoidance, or instrumental, group. Thus, one study suggests that classical 
conditioning is superior, whereas the other suggests that instrumental condition¬ 
ing is superior. Obviously, there are numerous problems with this type of com¬ 
parison experiment, and they have occupied psychologists for many years. Hellige 
and Grant (1974a, 1974b) provide a treatment of this approach. 

Traditional Differences Between Instrumental 
and Classical Conditioning 

If you had taken this course a number of years ago, you might well have read that 
instrumental and classical conditioning were clearly distinct and that there were 
several obvious differences between the two types of conditioning. Things were 
relatively simple then. But matters have become a bit more complicated, and 
psychologists have begun to question the validity of these traditional differences. 

DISTINCTION 1: CONTIGUITY AND REINFORCEMENT. Many psychologists believe 
that instrumental conditioning involves reinforcement, whereas classical con¬ 
ditioning does not. According to this position, all instrumental responses, such as 
bar pressing, are laid down, or acquired, through the application of reinforce¬ 
ment. Unless the animal is reinforced, the response will not be acquired. In 
contrast, classically conditioned responses are presumed to be acquired without 
the aid of reinforcement. If a response (such as an eye blink) occurs in the 
presence of a stimulus (such as a buzzer), that response will be connected to that 
stimulus automatically and without reinforcement. According to this position, 
temporal contiguity of the stimulus and response is supposed to be necessary and 
sufficient for classical conditioning to occur. (Two things are temporally con¬ 
tiguous if they occur close together in time.) Temporal contiguity is assumed to be 
necessary for instrumental conditioning but not sufficient. For instrumental con¬ 
ditioning to occur both contiguity and reinforcement are required. 

We already know’ w’here this distinction begins to break dowm. Classical 
conditioning may w’ell involve subtle types of reinforcement. A conditioned eye 
blink, for example, may be reinforced in that it partially avoids the noxious air 
puff. This is not to say that it has been clearly established that classical condition¬ 
ing always involves reinforcement. But it is a possibility, and to the extent we feel 
that this interpretation is valid, we shall want to reject the notion that classical 
conditioning does not involve reinforcement. On this basis classical and in¬ 
strumental conditioning cannot be distinguished with any certainty, for reinforc¬ 
ing qualities can be identified in both situations. Kimmel (1965) has considered 
this argument in detail. 

Furthermore, studies (see Rescorla & Wagner, 1972) have indicated that tem¬ 
poral contiguity may be necessary but not, after all, sufficient for classical 
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conditioning to occur. You will recall from Chapter 2 that an experiment can be 
conducted to demonstrate that, if a UCS is equally likely to occur or not to occur 
with a CS, then that CS will not begin to elicit a CR even though it occurs in 
temporal proximity to the UCS and the UCR. In other words, the CS and the UCS 
do occur together in a contiguous relationship, at least part of the time, but no 
conditioning occurs. Thus, reinforcing events can be identified in classical situa¬ 
tions, and, further, contiguity is not sufficient for classical conditioning to occur. 

DISTINCTION 2: THE EFFECT OF RESPONSES ON^ THE ANIMAL’S FUTURE. What 
happens to an animal in an instrumental conditioning situation is strongly 
affected by whether it responds. If it responds, it receives a reinforcement. If it 
fails to respond, it gets nothing. Events in the classical conditioning situation, by 
contrast, are often presumed to be independent of the animal’s actions. If the 
animal fails to respond to the CS, it receives the UCS. If it responds to the CS, it 
still receives the UCS. The CS and UCS are delivered in the same manner each 
time, regardless of the animal’s actions. Hence, many psychologists have con¬ 
cluded that in classical conditioning the animal’s future is unaffected by its 
actions, whereas in instrumental conditioning the animal’s actions strongly affect 
its future. 

This argument is specious. As we have seen, the animal’s future is indeed 
affected by its conditioned response in a classical situation. Receiving an air puff 
against a partially closed eye is probably not the same as receiving one against a 
fully opened eye. 

This so-called distinction is closely tied to the previous one. To the extent that 
an animal’s conditioned response is followed by a reinforcer (air-puff avoidance), 
then that conditioned response is clearly affecting the animal’s future (Estes, 
1969; Wahlsten & Cole, 1972). 

DISTINCTION 3: VOLUNTARY VERSUS INVOLUNTARY RESPONDING. Of all the dis¬ 
tinctions between instrumental and classical conditioning, the one most common¬ 
ly cited involves the notion that classical conditioning is a simpler type of learning 
and that only involuntary responses can be modified by it. Because instrumental 
conditioning is presumed to be a more complicated type of learning, it is argued 
that only voluntary responses can be modified by this technique. 

The distinction seems to make sense, in some intuitive way, when one surveys 
the responses discussed in previous sections. Involuntary responses, such as eye 
blinks, heart rate, and salivation, have been modified by classical procedures. 
Complicated voluntary responses, such as bar pressing and leaping over barriers, 
have been modified by the instrumental procedures. The assumption behind this 
distinction is that the two types of conditioning involve different levels, or 
aspects, of the total nervous system, with the classical procedures involving 
involuntary response systems and the instrumental procedures modifying more 
advanced voluntary responses. As we shall see, there is growing doubt about the 
validity of this distinction. 

NEAL MILLER’S WORK. Against all traditional thought, Neal Miller has asked an 
intriguing question: Can involuntary responses be conditioned instrumentally? In 
other words, can the strength of an involuntary response be increased if it is 
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Figure 3.2 Curarized rat fitted with respirator face mask. The rat is reinforced by 
electrical stimulation of the brain. Heart rate is automatically recorded. (Adapted from 
DiCara, L. V. Learning in the autonomic nervous system. Scientific American, 1970, 222, 
30-39. Copyright © 1970 by Scientific American, Inc. All rights reserved.) 

followed by a reinforcement? Over a number of years Miller (1973, 1978) and his 
colleagues have conducted a series of experiments that seem to suggest that 
involuntary responses can actually be conditioned instrumentally. 

Miller concentrated on visceral responses (for example, heart rate and intesti¬ 
nal contraction), which are mediated by the autonomic nervous system. The basic 
design of his experiments is as follows: The skeletal muscles of a rat are paralyzed 
with curare. (As these muscles are voluntarily controlled by the animal, paralysis 
removes them as a source of possible contamination in the experiment.) Paralysis 
with curare does not affect the involuntary responses under consideration. Next, 
the visceral response (say, heart rate) is recorded accurately. A rat’s heart rate is 
not constant. Sometimes it increases a little; sometimes it decreases. Each time the 
experimenters observe one of those little, spontaneous increases, they reward the 
animal. One problem Miller had to face was how to reward a paralyzed rat. He 
solved the problem by using escape from mild shock to the tail as one reinforce¬ 
ment. The animal had to increase its heart rate to escape shock. Miller found that 
rats will increase their heart rate from 5% to 20% in order to escape shock. He 
has also used brain stimulation as a reinforcer (see Figure 3.2). 

It is possible that in the situation Miller used (the animal in paralysis and being 
shocked) the heart rate would go up due to fear, excitement, tension, or general 
arousal. Recognizing this possibility, Miller rewarded some animals when they 
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Figure 3.3 Heart rate changes in rats rewarded for increases and decreases in heart rate, 
and heart rates of rats rewarded for intestinal contraction and relaxation. (After Miller, 
N. E., and Banuazizi, A. Instrumental learning by curarized rats of a specific visceral 
response, intestinal or cardiac. Journal of Comparative and Physiological Psychology , 
1968, 65, 1 — 17. Fig. 4, p. 5. Copyright 1968 by the American Psychological Association 
Reprinted by permission.) 


showed a decrease in heart rate. These animals decreased their heart rate over a 
series of sessions. Hence, the effect cannot be attributed to excitement or arousal. 

Figure 3.3 indicates that, when animals are rewarded for increases or decreases 
in heart rate, they will show increases and decreases, respectively. But animals 
rewarded for intestinal contraction or relaxation will not alter their heart rate 
(even though they will alter their intestinal activities). This specificity of the 
learning is further evidence for the notion that the experiment is not just increas¬ 
ing the general arousal level of the animal. 

In addition to his work with heart rate and intestinal contraction Miller has 
been able to condition urine formation. He has also reported being able to train a 
rat to blush in one ear but not the other. Keep in mind that the instrumental 
training of so-called involuntary responses argues against the position that in¬ 
strumental and classical conditioning can be distinguished on the voluntary- 
involuntary basis. 

TWO TYPES OF MEDIATION: MUSCULAR AND COGNITIVE. Some investigators have 
suggested that such experiments do not clearly demonstrate instrumental con¬ 
ditioning of involuntary responses. They argue that the effect on the involuntary 
system is mediated, or affected by, voluntary responses. For example, if you were 
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offered $5 for every increase you could produce in your pulse rate, you would 
quickly earn a lot of money by running around the block. The voluntary running 
response would cause the increase in the involuntary pulse rate. 

Mediation can be either muscular or cognitive. Let us begin with the fact that 
many of our so-called involuntary behaviors can be affected when we engage in 
certain voluntary muscular actions. Running around the block in order to in¬ 
crease one’s pulse rate is an example of this sort of muscular mediation. It has 
been demonstrated in the laboratory, too. For instance, Levenson (1976) reports 
that heart rate can be altered by changing breathing patterns and by engaging in 
certain forms of muscle tension. This represents a case in which we are not 
directly affecting or controlling the heart rate. The effect is mediated by voluntary 
muscular activity. 

Voluntary muscular mediation of changes in involuntary systems remains a 
serious problem in this field. But there is some evidence suggesting that more 
direct, nonmediated effects can be obtained. The first example is Miller’s work 
with curare and rats described above. Some human data arc also available. Miller 
(1978) and Bracker (1977) report and describe experiments that indicate that 
paralyzed humans can learn to control their blood pressure. In these cases, sad as 
they are, voluntary muscular mediation is impossible. 

The second form of mediation can be called cognitive mediation. In brief, we 
can affect certain involuntary responses by thinking certain thoughts. Miller 
(1978) describes two paralyzed human patients who were able to increase their 
blood pressure by thinking about either sex or horse races. Thinking about 
exciting, dangerous, or highly emotional things can affect our so-called involun¬ 
tary responses. To the extent that we assume that thinking is voluntary, we have 
another case, in addition to muscular mediation, in which involuntary activity is 
mediated by voluntary behavior. Even in those cases where muscular mediation is 
eliminated through paralysis, mental mediation can be operating. Thus, the case 
of a pure, incontestable instance of direct, unmediated control of involuntary 
activity still eludes us. 

Apparently we are very clever about mediating control of involuntary re¬ 
sponses. Hatch and Gatchel (1979) report that subjects sometimes begin their 
control of heart rate by thinking arousing thoughts. But when these arousing 
thoughts begin to lose their edge and go flat, the subjects will switch to some form 
of muscular activity to continue the heart rate control (sec also Goldstein, 1979). 

THE PROBLEM OF REPLICATION. Another problem in this area of research has 
been the progressive decline in curare results obtained by Miller over several years 
(see Figure 3.4). Miller acknowledges this decline, describes it as a vexing phe¬ 
nomenon, and speculates about its causes. He wonders if it might result from 
some subtle form of contamination in our environment or from changes in the 
rats themselves due to extensive breeding practices or antibiotic treatment. He 
himself is skeptical of these results (Miller, 1978). 

Whatever the outcome, Miller has suggested that so-called involuntary re¬ 
sponses may well be affected by the procedures we normally describe as in¬ 
strumental. This argues against the voluntary—involuntary distinction between 
instrumental and classical conditioning. 
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Figure 3.4 Progressive decline in amount of learned change in heart rate obtained in 10 
experiments over a 5-year period. (Adapted from Miller, N. E. Interactions between 
learned and physical factors in mental illness. Seminars in Psychiatry , 1972, 4, 239-254. 
Fig. 8, p. 247. Reprinted by permission of Grune & Stratton, Inc., New York, N. Y.) 


Biofeedback 

Miller’s work and the development of biofeedback training procedures represent 
some of the most intriguing events of the recent past (Kamiya, Barber, Miller, 
Shapiro, & Stoyva, 1977; Shapiro, Barber, DiCara, Kamiya, Miller, & Stoyva, 
1973). This material is almost like science fiction, and once you read about it, you 
will be able to understand why there has been such an enormous increase in the 
amount of attention paid to these procedures. In biofeedback training humans are 
given information about involuntary changes in their own body (for example, 
changes in heart rate or blood pressure). Normally, the subjects are not aware of 
these changes, which are often thought to be beyond voluntary control. For 
example, if you try to increase your blood pressure, you may decide that it is 
beyond your control. In the biofeedback procedure the experimenter rewards the 
subjects for responses of a certain amplitude or duration. Just as was the case with 
Miller’s rats, the human subjects alter their so-called involuntary responses. For 
example, subjects may hear a click each time their heart beats. The experimenter 
may then reward increases or decreases in the click rate. The heart rate varies a 
little, spontaneously. The experimenter watches these fluctuations and rewards 
the ones he or she chooses to reward. The reward may be just about anything 
(money, praise, avoidance of shock). Over time the subjects’ heart rate moves in 
the rewarded direction. 

This kind of experimental investigation of the control of involuntary systems 
was predated by many practices observed in Eastern cultures. There have been 
many reports suggesting that meditation, as practiced by yoga and Zen medita¬ 
tors, can have strong effects on physiological activities presumed to be beyond 
voluntary control. For example, Wallace and Benson (1972) describe studies in 
which Zen meditators dramatically decreased their oxygen consumption and 
carbon dioxide output during meditation. Green (1973) reports a case in which a 
swami caused his heart to cease pumping blood for 17 seconds. Western science 


64 PART ONE/INTRODUCTION 













and technology are just beginning to explore events that have long been known to 
Eastern meditators, who believe “body” to be a physical projection of “mind.” 

Quite a few so-called involuntary responses have been brought under in¬ 
strumental control. They include, among others, heart rate, blood pressure, 
electrical activity of the brain (Plotkin, 1979), skin temperature, salivation, vomit¬ 
ing, and electrodermal activity (Blanchard &c Young, 1974; Kamiya et ah, 1977; 
Shapiro & Schwartz, 1972). 

APPLICATION OF BIOFEEDBACK TECHNIQUES. If the research pays off, biofeed¬ 
back techniques may be used in many different ways (Bird, Catoldo, & Barker, 
1981; Kallman & Gilmore, 1981). Consider psychosomatic illnesses, in which 
physical symptoms appear when there are no identifiable underlying physical 
causes. For example, imagine an individual in a state of constant gastric distress, 
with physicians unable to find any physical basis for the disorder. Biofeed¬ 
back research suggests that the symptoms have been learned. The gastric distress 
symptoms may be “involuntary” responses that have been instrumentally reward¬ 
ed: every time the gastric symptoms appear, the patient is allowed to stay home 
from the office. A child may instrumentally record a fever if fever is consistently 
rewarded by attention from a set of worried parents. A soldier may develop a true 
paralysis if the condition avoids having to engage in combat. Unknowingly, these 
individuals may have undergone biofeedback training. The so-called involuntary 
illnesses may have been brought under instrumental control. 

Although there is little hard evidence at present, it is hoped that these kinds of 
response may be extinguished by removing the reinforcers, just as a rat ceases to 
press the bar when reinforcement is removed. If the child with the fever is not 
given reinforcing attention, the fever may well subside. It really makes no differ¬ 
ence whether the changes are mediated (muscularly or mentally) or not. As long 
as some good is done, the technique must be considered useful. 

Biofeedback techniques have already been used in connection with a wide 
range of disorders, including high blood pressure, tension headaches, migraine 
headaches, epilepsy, gastrointestinal disorders, asthma, neuroses, anxiety, insom¬ 
nia, bruxism (grinding of the teeth), and enuresis (bed-wetting) (Miller, 1978; 
Sargent, Walters, & Green, 1973). Relaxation responses have also been tied to 
biofeedback techniques (Tarler-Benlolo, 1978). 

People suffering from a rare condition called Raynaud’s disease (Keefe, Surwit, 
& Pilon, 1980) can be helped by biofeedback too. People with this disease suffer 
from painful spasms that block the flow of blood in the fingers and toes. When the 
spasm ends, further pain is caused by the abrupt restoration of the blood flow. But 
through biofeedback training people can learn to raise their skin temperature, 
which helps reduce the severity and the occurrence of the spasms. 

The possibilities are almost unlimited. The literature of this new and controver¬ 
sial field covers a wide range of concerns from the scientific to the mystical. 
Whatever the eventual outcome of all this activity, biofeedback is a new and 
challenging area for study. 

BEHAVIORAL MEDICINE. Biofeedback belongs to, or is one part of, a growing 
field called behavioral medicine. In the past, psychology has been related to the 
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medical field of psychiatry and has not extended beyond that area. New 
psychological data and theories are beginning to be related to a much larger 
segment of the medical field, including concern for stress, psychosomatic symp¬ 
toms, the immune system, and other diseases. The mind and the body are turning 
out to be much more closely related than some had imagined (Miller, 1976; 
Williams & Gentry, 1977). In behavioral medicine psychological techniques are 
used to treat such things as high blood pressure, headaches, overeating, smoking, 
and drinking (Masek, Epstein, & Russo, 1981). 

Summary of the Comparison 

The three distinctions between classical and instrumental conditioning mentioned 
in the preceding pages are not the only ones that have been proposed. But they are 
perhaps the most important ones. The clarity of these distinctions has begun to 
fade, and with new thinking and research it seems that at the very least one must 
conclude that instrumental and classical conditioning are very closely related. 
Some thinkers (Bindra, 1972) have gone as far as to assert that classical and 
instrumental conditioning are the outcomes of some fundamental underlying 
process. 

PROCEDURAL COMPARISONS. To make sense of the array of learning tasks it is 
useful to distinguish three ways in which learning tasks can be compared. If we 
merely look at the procedures involved in the various learning tasks, it is clear 
that they do differ. No one would deny that classical procedures differ from 
instrumental procedures. In instrumental conditioning a reward is given only 
when the animal responds, whereas in classical conditioning the experimenter 
presents the CS and UCS whether the animal responds or not. But we have 
already seen that what the experimenter thinks is important may or may not be 
important to the animal. Despite the insistence that events in classical condition¬ 
ing are independent of the animal’s response, we have seen that they are not. 
Thus, if we are to discover true differences among the various learning situations, 
we must look beyond these superficial procedural differences. 

COMPARISONS AT THE NEUROLOGICAL LEVEL. Learning tasks can be compared 
in terms of their location in the nervous system. If one task involves, or is 
associated with, action in one part of the nervous system, then it can be distin¬ 
guished from other types of learning that are to be found in other levels of the 
nervous system. Yet it has proved difficult to separate learning tasks on this basis. 
The traditional distinction between involuntary classical conditioning and volun¬ 
tary instrumental conditioning has broken down. Instrumental conditioning has 
been heavily implicated in those aspects of the nervous system (for example, the 
autonomic nervous system) that were once presumed to be the domain of classical 
conditioning alone. 

BEHAVIORAL COMPARISONS. In view of the fact that procedural distinctions may 
be superficial and neurological distinctions difficult to maintain, we must look for 
other ways to try to compare various learning tasks. For example, we can 
determine whether the tasks react in similar ways to the same manipulations. Say, 
for example, that we are faced with two children and want to find out how similar 
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Figure 3.5 Performance curves, real and fanciful. 

they are. We might begin by taking something from each of them. If they both 
scream and yell, look hurt, or react with scorn, then we have some evidence for 
the idea that, at least in some ways, the children are similar. The more similarities 
we can find, the more common responses the two children have in similar 
situations, the more likely we are to believe that they are similar in some 
fundamental ways. 

The same kind of comparison technique can be applied to learning tasks in an 
effort to answer the following types of question: How are various tasks affected 
by the same important variables (for example, amount of practice and external 
distractions)? Do they all show similar increases, or decreases, in performance? 
Or are they totally different in their responses to a given variable? To the extent 
that they react similarly to a wide range of variables, we have some evidence for 
the conclusion that they are similar, or operate according to the same behavioral 
laws. 


COMMON PHENOMENA: SIMILARITIES AMONG TASKS 

Acquisition 

What happens to performance as practice continues? The simplest answer is that 
performance increases as practice increases. But further questions can be asked. 
For example, what is the specific form of the relationship between practice and 
performance? Does performance increase slowly at first and then more rapidly as 
practice increases? Or does performance start out rapidly and then level off? 
Figure 3.5 suggests some of the relationships, both reasonable and fanciful, that 
might exist between practice and performance. Without some advance knowledge 
it is difficult to decide which of these curves represents the general relationship 
between performance and practice. 

One might also wonder whether the relationship might differ for different types 
of learning task. For example, are word-list recall, classical conditioning, and 
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instrumental conditioning represented by the same curve? Figure 3.5c represents 
the curve most often obtained with all three of these tasks. As practice, or the 
pairing of the CS and the UCS, increases in classical conditioning, the probability 
of the CR increases quite rapidly and then levels off (see Figure 3.6a). Similarly, 
the most common finding in instrumental learning is that the response builds up 
rapidly and then levels off (see Figure 3.6b). And finally, if subjects are shown a 
list of words 20 times and asked to recall as many of them as they can after each 
exposure, their performance also shows an initial spurt and then a leveling off (see 
Figure 3.6c). 

This characteristic form is termed a negatively accelerated performance curve , 
which means that increases in performance start out rapidly and then level off. 
The fact that the negatively accelerated performance curve is characteristic of 
many learning tasks is a tiny bit of evidence for the conclusion that the tasks are 
similar. That they yield similar curves in response to the same variable (practice) 
suggests that they might possibly operate according to the same behavioral laws. 

Three points should be kept in mind. First, the negatively accelerated curve is 
only characteristic of these tasks—it does not always appear. It is the most 
commonly observed curve but not the only one that can be obtained. On the one 
hand, a PA list can be constructed that is so easy it will be learned on the first trial 
(composed, for example, of pairs such as heigh—ho, one—two, and black-white). 
On the other hand, PA lists can be developed that are next to impossible to master 
(composed, for example, of such pairs as XZKQZZKXO-QKQXQXZKX, 
QZKXKZXQX—QXKKZQZKX, and XZKZQKZXQ-XKKQXZQKQ). 
Second, keep in mind that these are not learning curves. They are performance 
curves. If you recall, performance refers to the translation of learning into 
behavior through the involvement of motivation. All we can observe is perform¬ 
ance, or behavior, and this is affected by motivation as well as learning. Third, 
remember that we have been selective in comparing only a few learning tasks. If it 
is found that they react in similar ways to a wide range of variables, we must still 
be careful about wanting to conclude that all types of learning task are similar. 

Extinction 

Extinction refers to a decrease in response strength with repeated nonreinforce¬ 
ments. If food is no longer given to a rat after the animal presses a bar, it will 
gradually cease to press. A similar effect will also appear in classical conditioning. 
One of the most common findings in classical conditioning is that, if the UCS is 
removed from the situation, the CR will fade. If a bell (CS) is presented over and 
over again without an air puff (UCS), the blink (CR) will lessen and finally cease 
altogether. Extinction appears in various verbal and cognitive tasks, too. For 
example, we will learn in Chapter 8 how paired-associate learning can be extin¬ 
guished. We will find that, if a subject has first connected BAV to JAX and is then 
no longer reinforced for giving BAV as the response, the probability of JAX’s 
eliciting BAV will decrease. 

Thus, extinction-like effects appear in many tasks. Again, in hindsight, this is 
far from astounding. But things might have turned out differently. Classically 
conditioned responses might have been permanent. Instrumentally conditioned 
responses might have been obliterated by the mere passage of time rather than 
through this active extinction process. There are any number of things that might 
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Figure 3.6 Actual data obtained in (a) classical, (b) instrumental, and (c) word-list recall 
situations. Notice that all curves are negatively accelerated and resemble the hypothetical 
curve in Figure 3.5c. [(a) Adapted from Kehoe, E. J. The role of CS-UCS contiguity in 
classical conditioning of the rabbit’s nictitating membrane response to serial stimuli. 
Learning and Motivation, 1979, 10, 23-28. Fig. 1, p. 28; (b) adapted from Roberts, 

W. A. Spatial memory in the rat in a hierarchical maze. Learning and Motivation, 1979, 
10, 117-140. Fig. 7, p. 128; (c) unpublished data gathered by author.] 
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have been true. But the fact that they show similar extinction effects adds to the 
argument that they are all operating according to the same laws. 

Extinction is an interesting and controversial phenomenon. In a sense, it means 
that things are not forgotten passively. If the bar-pressing rat is left alone in its 
home cage, away from the bar, it will retain the bar-pressing behavior for long 
periods. The mere passage of time does little to reduce the response; for it to be 
erased the animal must make the response over and over again and not be 
reinforced. This view is difficult to reconcile with our subjective feelings about 
how we forget things. Though our memories just seem to fade with time, the 
literature on extinction suggests that humans may not forget things passively. 
When something is forgotten, it may undergo an active extinction process in 
which the responses are made but not reinforced. Although this proposition is 
extremely controversial, there is evidence suggesting that much of our forgetting 
is not passive. There may be a good deal of correspondence between experimental 
extinction, as observed in the Skinner box, and normal human forgetting. The 
relationship between animal extinction and human forgetting will be discussed in 
a later chapter, where alternative views of human memory will be presented. 

EXTINCTION AS ACQUISITION. Extinction can be thought of as nothing more 
than an acquisition process. For example, when the rat’s bar-pressing response is 
extinguished, this situation may be envisioned as one in which the animal actually 
learns not to respond or acquires the response of not responding. Human subjects 
may have to learn not to say “BAV” when they see JAX. According to this 
interpretation, the organism’s response is not just dropped out owing to nonrein¬ 
forcement. The animal is actually learning something new. It is learning not to 
make the response. 

INSTRUMENTAL SITUATIONS: TANTRUMS. The principle of extinction has often 
been used in behavior-modification situations to eliminate undesirable in- 
strumentally conditioned responses. For example, Williams (1959) made the 
assumption that the temper-tantrum behavior of a 21-month-old boy was a 
learned behavior that was being unknowingly rewarded by the parents. Each time 
the boy threw a tantrum at bedtime, the parents became greatly concerned; this 
concern was rewarding to the child, because it delayed bedtime and gained 
attention from the parents. If we assume that the behavior is maintained because 
it is rewarded, we go about eliminating it in the same way we extinguish bar 
pressing in a Skinner box. We remove the reinforcement. We put the individual on 
an extinction schedule. We assume that the boy’s tantrum is being reinforced by 
“getting his own way” or by his receiving attention from his parents when 
tantrum behaviors are shown. Eliminate these rewards, and the behavior should 
subside. This was what Williams demonstrated. The behavior was terminated by 
gently putting the child to bed, leaving the room, and closing the door. The first 
night the screaming and raging lasted 45 minutes before the child finally dropped 
off to sleep. (That amount of storming would cause all but the most carefully 
instructed parent to break down and enter the room.) As can be seen in Figure 3.7, 
the length of the tantrums dropped dramatically on successive nights. The curve 
labeled “First extinction” represents the initial series of nights without social 
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Figure 3.7 Duration of crying as a function of the number of times the child was put to 
bed. (Adapted from Williams, C.D. The elimination of tantrum behavior by extinction 
procedures, journal of Abnormal and Social Psychology, 1959, 59, 269. Fig. 1, p. 269. 
Copyright 1959 by the American Psychological Association. Reprinted by permission.) 

reinforcement or attention. After this first extinction, the child’s aunt began 
taking care of the child in the absence of the parents. The tantrums started again, 
and the aunt fell into the trap of staying in the room until the child fell asleep. The 
tantrum behavior had to be extinguished all over again (see “Second extinction” 
in Figure 3.7). According to the author, following the second extinction no 
further tantrums were reported during a 2-year follow-up. 

CLASSICAL SITUATIONS: SYSTEMATIC DESENSITIZATION. We have already learned 
in Chapter 1 that phobias, or strong irrational fears, may be the result of classical 
conditioning. If they are, then we should be able to extinguish them. One of the 
best-known forms of extinction therapy is that of systematic desensitization 
(Wolpe, 1958, 1969). Some individuals find themselves inordinately frightened by 
certain types of stimuli and situations (death, illness, social contacts, being alone, 
injections, fear of losing their mind, pregnancy, medicines, fainting). These anx¬ 
ieties, fears, phobias, or whatever you wish to call them can be quite incapacitat¬ 
ing. That is, the critical stimuli elicit such a strong fear reaction that the subject is 
unable to function as well as he or she might in everyday life. 

Systematic desensitization involves two steps. In the first phase the subject is 
taught to relax. Relaxation responses can be encouraged in a number of ways. 
One technique asks the subject to alternately tighten and relax specific muscles. 
For instance, “Tighten or clench the muscles in your right arm. Now relax them.” 
Through this process we can learn what it feels like to relax. We can learn to 
recognize, and establish, a relaxed condition (see Jacob, Turner, Szekely, &C 
Eidelman, 1938). 
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Having trained the subject to relax, the therapist moves on to the second phase 
of systematic desensitization, which deals directly with the fears, or the phobic 
reactions. The object of desensitization is to eliminate these fears. The fear- 
producing stimuli are thought to be conditioned stimuli (CSs). They have, some¬ 
time in the past, been paired with a UCS, which elicits a fear reaction (UCR). The 
fear the subject experiences in response to the CS is a conditioned response (CR), 
or a conditioned fear. We extinguish a CR by presenting the CS alone, over and 
over again, without the UCS. With one majpr variation this same extinction 
procedure forms the basis of desensitization. In initial interviews the therapist 
asks the subject to list fear-producing stimuli and to rank them in terms of the 
strength of the fear they produce. Thus, a hierarchy of fear-producing stimuli and 
situations is produced by the subject. The therapist then asks the subject to think 
about, to dwell on, and to imagine the weakest of these stimuli or situations. The 
subject is instructed to relax and to imagine this weak fear-producing situation 
until it no longer elicits any fear at all. Essentially this is an extinction procedure. 
The CS is present without the UCS. 

At the same time, it is a little more than a simple, pure extinction procedure, 
because the subject is encouraged to relax in the presence of the CS. In a sense, the 
relaxation response is assumed to become conditioned to the CS while the fear 
response is extinguished. This process, in which one CR is extinguished while 
another is established, is called counterconditioning. The fear response is replaced 
by the relaxation response (see Capaldi, Viveiros, & Campbell, 1983). 

Once the fear of the weakest stimulus has subsided and the subject has become 
able to relax in its presence, the subject goes on to the next most potent stimulus 
in the hierarchy. By gradually moving up the hierarchy and by extinguishing fear 
at each step and replacing it with relaxation, the subject is able to overcome fear 
of even the most frightening stimuli. Sample hierarchies for claustrophobia, 
death, and illness are given in Table 3.1. Hierarchies, of course, differ from 
individual to individual. 

Although phobias and anxiety have often been combated with the systematic 
desensitization procedure (see Donovan & Gershman, 1979), it should be noted 
that many additional procedures have also been used in trying to deal with these 
thorny problems (see Emmelkamp, 1979; Ohman, 1979). 

It should also be noted that extinction procedures have been applied to more 
than just tantrums and phobias. Similar extinction procedures have been effective 
with, among others, deviant sex-role behaviors (Rekers & Lovaas, 1974), passiv¬ 
ity (Johnston, Kelley, Harris, & Wolf, 1966), aggressive behavior (Allen, Henke, 
Harris, Baer, & Reynolds, 1967), vomiting (Munford & Pally, 1979), stomach 
pain (Miller & Kratochwill, 1979), and hair pulling (Sanchez, 1979). 

Spontaneous Recovery 

Spontaneous recovery refers to the fact that an extinguished response will, ivith 
rest, recover some of its strength. The general procedure for demonstrating 
spontaneous recovery involves four steps (Figure 3.8). First, the animal learns a 
particular response. Second, that response is extinguished through nonreinforce¬ 
ment. Third, the animal is allowed to rest. Fourth, the strength of the response is 
tested following the rest interval. If the response is stronger at the end of the rest 
phase than it was at the end of the extinction phase, then spontaneous recovery is 
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Table 3.1 Anxiety Hierarchies 


A. Claustrophobic Series 

1. being stuck in an elevator (the longer the time, the more disturbing) 

2. being locked in a room (the smaller the room and the longer the time, the more disturbing) 

3. passing through a tunnel in a railway train (the longer the tunnel, the more disturbing) 

4. traveling in an elevator alone (the greater the distance, the more disturbing) 

5. traveling in an elevator with an operator (the longer the distance, the more disturbing) 

6. on a journey by train (the longer the journey, the more disturbing) 

7. being caught in a dress with a stuck zipper 

8. having a tight ring on her finger 

9. visiting ana unable to leave at will (for example, if engaged in a card game) 

10. being told of somebody in jail 

11. having polish on her fingernails and no access to remover 

12. reading of miners trapped underground 

B. Death Series 

1. being at a burial 

2. being at a house of mourning 

3. the word death 

4. seeing a funeral procession (the nearer, the more disturbing) 

5. the sight of a dead animal (for example, a cat) 

6. driving past a cemetery (the nearer, the more disturbing) 

C. Illness Series 

1. hearing that an acquaintance has cancer 

2. the word cancer 

3. witnessing a convulsive seizure 

4. discussions of operations (the more prolonged the discussion, the more disturbing) 

5. seeing a person receive an injection 

6. seeing someone faint 

7. the word operation 

8. considerable bleeding from another person 

9. a friend points to a stranger, saying, “This man has tuberculosis.” 

10. the sight of a blood-stained bandage 

11. the smell of ether 

12. the sight of a friend sick in bed (the more sick looking, the more disturbing) 

13. the smell of methylated spirits 

14. driving past a hospital 


Adapted from Wolpe, J. The systematic desensitization treatment of neuroses, journal of Nervous and Mental Disease , 1961, 132, 
189-203. P. 197. Copyright 1961 by The Williams flc Wilkins Co., Baltimore. Reprinted by permission. 

said to have occurred. After the response has recovered, it may be brought back to 
full strength through further reinforcement, or it may be reextinguished through 
nonreinforcement. 

This phenomenon appears in verbal, classical, and instrumental conditioning 
situations. Pavlov (1927) reports experiments of the following sort. Dogs were 
classically conditioned to salivate. Then the CS was presented alone for a series of 
extinction trials. These extinction trials reduced the number of drops of saliva 
secreted in response to the CS from 10 to 3. The animal was then allowed to rest 
for approximately half an hour. At the end of the rest interval the CS was 
presented alone, and 6 drops of saliva were secreted. The salivary response had 
spontaneously recovered some of its strength. Characteristically, recovery is not 
complete. The response does not usually return to the highest level obtained 
during the acquisition phase. 

Lewis (1956) has demonstrated spontaneous recovery of an instrumentally 
conditioned response. He first trained rats to run down a straight alley to food. 
He extinguished the response and then allowed the rats to rest. At the end of the 
rest interval Lewis observed that the extinguished running response had spon¬ 
taneously recovered some of its strength. 
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Figure 3.8 Schematic representation of the course of acquisition, extinction, and 
spontaneous recovery. 

Spontaneous recovery of verbal material has been shown, too (Birnbaum, 
1965; Houston, 1966b; Slamecka, 1966). The general procedure has been to 
extinguish a learned list and then to observe whether the extinguished, or forgot¬ 
ten, list will recover some of its strength with time (Postman, Stark, & Fraser, 
1968; Saltz, 1965; Silverstein, 1967). The details of this procedure will be consid¬ 
ered in a later section. For now, it is sufficient to say that the “forgotten” 
associations do seem to recover some of their strength with rest. 

In summary, acquisition, extinction, and spontaneous recovery all seem to be 
common to many learning tasks. This suggests that, at least on a behavioral level, 
the tasks are closely related. 

External Inhibition 

External inhibition refers to the fact that a new stimulus introduced during 
acquisition, along with the CS, will slow the acquisition process. If a buzzer is 
suddenly sounded along with the CS in eye blink conditioning, the conditioning 
will be slowed. The same kind of effect can be obtained in instrumental condition¬ 
ing. For example, Winnick and Hunt (1951) trained rats to run down an alley to 
obtain food at the end. A buzzing sound introduced at the beginning of a trial 
slowed the rat’s performance dramatically. 

Few verbal or cognitive tasks deal directly with external inhibition, but it is a 
simple matter to imagine what would happen if you were trying to learn a list of 
Greek words and someone sounded a buzzer out of the blue. You would be 
externally inhibited, to say the least. 

External inhibition can be thought of as a distraction effect. Any change in the 
conditioning situation that distracts the subject from the task will reduce per¬ 
formance accordingly. 

Subjects tend to adapt to external inhibitors. The disruptive quality of an 
external inhibitor is temporary. After a number of trials in which the buzzer is 
presented along with the CS, performance returns to its original level. 
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Figure 3.9 The shadowing effect. (From Cognition by Margaret Matlin. Copyright © 
1983 by CBS College Publishing. Reprinted by permission of Holt, Rinehart and 
Winston, CBS College Publishing.) 


SHADOWING. Some studies in cognitive psychology resemble, if they don’t 
actually duplicate, the external inhibition effect. For example, Treisman (1960) 
put earphones on her subjects and delivered different messages to each ear. 
Subjects were told to attend only to, say, the right ear and to ignore what was 
coming into the left ear (see Figure 3.9). Furthermore, they were asked to say out 
loud what they had heard in the right ear; that is, they were asked to “shadow” 
the right-ear message. Treisman found that, although subjects were pretty good at 
ignoring the left-ear message (an external inhibitor?), they occasionally become 
distracted and repeated what they had heard through the left ear rather than what 
they had heard on the right side, as Figure 3.9 illustrates. Performance was 
disrupted by an external stimulus. 

Stimulus Generalization 

Stimulus generalization refers to the fact that a response conditioned to one 
stimulus will tend to be elicited by other similar stimuli. The greater the difference 
between the conditioned stimulus and the test stimulus, the less is the tendency for 
the test stimulus to elicit the response. 

For example, we have all learned to stop our car when we see round, red lights. 
Suppose that we were faced with a red light that was not quite so bright and not 
quite so red as the ones we were used to. We would probably stop anyway. What 
if the light were very dim or square? Our stopping response might not be so 
salient. Or if the light were pink and oblong? Or if it were purple and 1 foot off 
the ground? We can see that, as the similarity between the new, test stimuli and 
the original, round, red stimulus decreases, our generalized tendency to stop will 
decrease. 

Stimulus generalization has been one of the main concerns in the field of 
learning, and thousands of studies have been published concerning it. The impor¬ 
tance of stimulus generalization in our own life is undeniable. Suppose for the 
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Figure 3.10 Stimulus-generalization gradient for pigeons that were trained to peck in 
the presence of a colored light of 580 nm wavelength and then tested in the presence of 
other colors. (From Guttman, N., and Kalish, FI. I. Discriminability and stimulus 
generalization. Journal of Experimental Psychology , 1956, 51, 79-88. Copyright 1956 
by the American Psychological Association.) 

moment that you cannot generalize. If you connect a response to a given stimulus, 
then only that stimulus will elicit that response. Similar stimuli will not elicit the 
response. Your life will probably be dangerous and confusing. We need to 
generalize in order to get along in the world. Otherwise we would run red lights in 
strange towns and have trouble identifying strangers as people. 

Stimulus generalization has often been demonstrated in classical conditioning 
(Moore, 1972). In general, a two-step experimental procedure is used. First, a CR 
is connected to a particular CS (for example, a given tone). Second, after learning 
is completed, similar stimuli (varying tones) are presented. It is usually found that 
the probability of the response to the varying test stimuli increases as the similar¬ 
ity between the test stimuli and the original stimulus is increased. 

Guttman and Kalish (1956) have demonstrated stimulus generalization in 
instrumental conditioning. Their general procedure has been to train pigeons 
to peck a colored disk. The pecking response is rewarded with food. After 
the response is well established, the pigeons are tested with other colors. As 
the similarity between the training and the test stimuli increases, the response to 
the test stimuli increases. This effect is summarized by the stimulus generalization 
gradient depicted in Figure 3.10. 

Morosko and Baer (1970) describe an interesting case of stimulus generaliza¬ 
tion. Alcoholic subjects were shocked while ingesting small cups of alcoholic 
beverages. They learned to avoid the shock by refraining from drinking the 
beverages. Follow-up work indicated that the avoidance behavior generalized 
across alcoholic beverages. When subjects were shocked for drinking vodka, they 
showed a decreased interest in drinking other forms of liquor as well. But as we 
would expect from our knowledge of stimulus generalization gradients, the effect 
was only partial; that is, the trained variety was the most aversive. The subject 
shocked for drinking vodka continued to drink such beverages as beer, but in 
reduced amounts. 
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As we have seen, there is another interesting experiment that suggests that 
stimulus generalization operates in connection with verbal materials. In semantic 
conditioning a response is connected to a verbal item (for example, a word). 
Varying test words are then presented to the subject. It is usually noted that the 
response is elicited by similar test stimuli (Maltzman, 1977). In one experiment 
Lang, Geer, and Hnatiow (1963) presented 12 very hostile words (such as anni¬ 
hilation) along with some neutral ones {abstract). Each time a hostile word was 
presented, the subject was shocked. The shock yielded a change in the galvanic 
skin response (GSR). (The GSR is an index of the electrical conductivity of the 
surface of the skin. In other words, when we are shocked, we perspire a little. The 
perspiration conducts electricity.) After the training period, subjects were pre¬ 
sented new test words that were highly hostile, hostile, of low hostility, or neutral. 
The authors found that, the higher the hostility of the test words, the greater was 
the change in the GSR. The GSR had generalized from the hostile training words 
to the similar hostile test words. 

We will return to stimulus generalization in Chapter 7. For now, realize that it is 
a phenomenon that occurs in many learning tasks. It is one more piece of evidence 
for the conclusion that learning tasks are behaviorally related. 

Discrimina don 

One of the situations illustrating discrimination learning that was described 
earlier involved a hungry pigeon and two disks of different colors. One disk was 
called the positive stimulus, and the other, the negative stimulus. If the pigeon 
pecked the positive stimulus, it received a reward. If it pecked the negative 
stimulus, it received no reward. The bird gradually discriminated between the 
colors and restricted its response to the positive stimulus. These events constitute 
an instance of discrimination learning in an instrumental conditioning situation. 
The same kind of discrimination formation occurs in classical conditioning. If one 
stimulus (for example, a buzzer) is consistently paired with a UCS while another 
stimulus (a light) is presented but not accompanied by the UCS, then a CR 
gradually develops in connection with the buzzer but not the light. The animal 
discriminates between the two potential CSs and only responds to the one that is 
paired with the UCS. 

Of course, cognitive and verbal tasks also involve discrimination formation. 
For example, many discriminations are required before a PA list can be mastered. 
Suppose that the stimuli in a PA list are very similar to one another (for example, 
QZKL, KLZQ, QZLK, KLQZ, KZQL, and QLZK). Before subjects can connect 
the proper responses to these stimuli, they must distinguish one from another. 
They all look pretty much alike at first, but continued exposure allows the 
subjects to discriminate among them. Thus, just as in classical and instrumental 
conditioning, discrimination formation is an integral part of verbal and cognitive 
learning processes. 


Summation 

Assume that a given response is separately connected to two different stimuli. If 
the two different stimuli are then presented together, the strength of the response 
is often greater than it is to either of the stimuli alone. This effect is called 
summation , and it can be demonstrated in many learning situations. Pavlov 
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conducted experiments in which a salivary response was first connected to a given 
CS. The salivary response was then conditioned to a second, different CS. When 
the two CSs were presented together, the amount of saliva produced was greater 
than that produced by either CS alone. The same kind of effect can be obtained in 
instrumental conditioning. For example, one might run an experiment in which a 
rat is trained in a T maze. The rat might first be required to run right for food if a 
buzzer is sounded and left if no buzzer is sounded. It might then be trained to turn 
right if a light is turned on and left if the light is not turned on. In the final step 
both the light and the buzzer would be presented. If the tendency to turn right in 
response to both the stimuli is greater than it was to either stimulus alone, then we 
would conclude that summation had been demonstrated. 

Hill and Wickens (1962) ran an experiment that demonstrated summation in 
PA learning. In part of their study, subjects first learned PA lists in which the 
stimuli were nonsense syllables and the responses were words. They then learned 
a second list in which the responses were the same words but the stimuli had been 
changed to colors. In the final phase the colors and the nonsense syllables were 
presented together. The authors noted that a definite advantage had been gained 
by having been trained with each of the different stimuli alone. 

CONCLUSION 

Even with all this discussion of similarities and differences it has not been 
conclusively demonstrated that these and other types of learning task represent 
fundamentally distinct learning processes. It is probably premature to conclude 
either that they do or that they do not represent truly different kinds of learning. 

Eventually it may be possible to develop a unitary model that will account for 
all these task variations. Obviously, the problem is that there are so many 
different learning situations that it is extremely difficult to imagine a single model 
that would encompass them all. In addition, the similarities we have discussed 
may be more apparent than real—they may be the result of a lack of precision in 
our language rather than any identity of basic process. But it was not the intent of 
this chapter to prove that the many learning tasks definitely represent different 
forms of some unitary, underlying process. Rather, it has been suggested that such 
a unitary process is a possibility and that this possibility should be given serious 
consideration before we expend considerable amounts of energy building new 
and distinct models for every new task we invent (see Tarpy, 1975). 

SUMMARY 

1. It is important to consider whether our various learning tasks reflect truly 
different learning processes. If they do, then theories and models governing 
each learning capacity must be developed. If it is determined that some 
common, underlying process accounts for all the various learning tasks, then 
a single model should suffice. 

2. There are various difficulties in trying to decide if the different learning tasks 
represent different kinds of learning. 

3. Paired-associate learning can be thought of as either classical or instrumental 
conditioning. Language learning also has much in common with other forms 
of learning. 
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4. A comparison of classical conditioning and instrumental conditioning is 
hampered by the fact that classical conditioning seems to possess characteris¬ 
tics, or elements, of the instrumental paradigm. Similarly, instrumental con¬ 
ditioning situations may never be free of classical conditioning elements. 

5. Attempts to distinguish between classical and instrumental conditioning by 
instrumentally and classically conditioning the same responses have been 
relatively unfruitful. 

6. The notion that instrumental conditioning involves reinforcement whereas 
classical does not is challenged by the fact that potentially reinforcing events 
can be identified in all classical paradigms and that contiguity is not sufficient 
for classical conditioning to occur. 

7. The notion that an animal’s future is affected by its response in instrumental 
conditioning but not in classical conditioning also seems not to be true. 
Classically conditioned responses do appear to affect the animal’s future. 

8. The notion that involuntary responses can only be classically conditioned is 
challenged by biofeedback research, which suggests that so-called involun¬ 
tary visceral responses can be instrumentally conditioned. But the field of 
biofeedback is not without its own problems. Replication is often difficult. In 
addition, some involuntary responses may be mediated by muscular and 
mental voluntary responses. Biofeedback research, which was predated by 
Eastern practices, may prove fruitful in an applied sense. 

9. Behavioral medicine is a new, growing field, which includes the area of 
biofeedback. 

10. Comparisons of learning tasks at a procedural level yield distinct, but super¬ 
ficial, differences. 

11. Distinctions among the various learning tasks at a neurological level have 
proved difficult to maintain. 

12. Behavioral comparisons of the various learning tasks reveal a wide range of 
similarities. The various tasks respond in similar ways to many sorts of 
manipulation. Similar performance and extinction curves appear in many 
different tasks. Spontaneous recovery, external inhibition, stimulus gener¬ 
alization, discrimination , and summation also appear in the various tasks. 

13. At present, it would seem premature to draw definite conclusions about the 
existence of multiple learning processes. Although there appear to be a good 
number of similarities among a number of learning tasks, the final signif¬ 
icance of these similarities remains to be seen. 
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THE “INTERCHANGEABLE-PARTS” CONCEPTION OF LEARNING 
American psychologists have spent the better part of this century developing what 
can be termed the “interchangeable-parts” conception of learning. According to 
this model, certain laws and generalizations govern the development of associa¬ 
tions between all stimuli and all responses. The nature of the stimuli, the nature of 
the responses, and the nature of the animal doing the learning are not considered 
to be particularly important. In this chapter we will examine, and challenge, that 
view. Specifically, we will explore the predispositions that animals have to learn in 
certain ways. 


Stimulus Interchangeability 

According to the interchangeable-parts conception, the specific nature of the 
stimulus is relatively unimportant in determining the course of learning. As long 
as the animal can perceive the cue, it can utilize it. For example, if we are training 
a dog to respond to a signal, it does not make much difference whether the cue is a 
light, a bell, or a mild shock. The notion is that, as long as we reinforce the animal 
when it responds to the stimulus, learning will occur in pretty much the same 
fashion, regardless of the nature of the stimulus. Curiously, this American belief 
in the equivalence of stimuli can be traced back to the Russian physiologist Pavlov 
(1928), who stated that any stimulus that is selected—a visual stimulus, a sound, 
an odor, or a touch—can become a conditioned stimulus. 

Response Interchangeability 

The same thinking has been applied to responses. Suppose that a dog is confined 
within a box and wants to escape. In one condition it must bark to gain release. In 
another condition it must roll over to escape. According to this line of thinking, it 
should not make much difference which of these two responses is required. 
Learning should progress at about the same rate in either case. The responses 
should be interchangeable. 


“Unequal” Stimuli 

Unfortunately, this simple, elegant model does not really fit the facts of nature. A 
growing body of literature suggests that such a conception vastly oversimplifies 
reality. For example, Garcia and Koelling (1966) showed that, when a specific 
flavor, a certain sound, and a particular light are all paired with an aversive 
stimulus, rats will associate the flavor but not the sound or the light with the 
aversive stimulus. In other words, the three stimuli are differentially effective as 
stimuli; they are not interchangeable. Similarly, chemically induced illness is 
easily associated with taste but not so easily associated with odor (Garcia & 
Rusiniak, 1979; Hankins, Rusiniak, & Garcia, 1976). One stimulus “works” 
much better than the other. 


“Unequal” Responses 

Just as some stimuli are more easily involved in a given association than others, 
certain responses seem to be conditioned more readily than others (Morgan & 
Nicholas, 1979). It is a simple matter to teach a rat to press a bar or leap a barrier 
for food. But if you try to teach a rat to scratch for the same reinforcement, you 
will be in for a hard time. The animal just does not seem ready to learn this 
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particular response when food is the reinforcement, even though it already knows 
how to scratch and presumably learned to do so in order to gain relief from 
itching. In a similar vein, Charlton (1983) has shown that it is very difficult to get 
hamsters to groom for food even though they are readily able to learn other tricks 
for the same reinforcement. 

Interacting Stimuli and Responses 

To complicate things further, it should be noted that certain responses are readily 
associated with particular stimuli but not with others, whereas other responses 
are readily associated with the opposite stimuli. In the rat, for example, shock- 
produced fear is easily linked to odor but not so easily to taste. Conversely, illness 
is readily associated with the opposite stimulus (taste) and not so easily with odor 
(Garcia &C Rusiniak, 1979). 

We are faced with the notion that each animal brings to every learning task 
certain capabilities and predispositions to act in certain ways. These distinct 
capabilities may well be the result of natural selection as well as of the individual 
animal’s past life experiences (Domjan & Galef, 1983). 

THE CONTINUUM OF PREPAREDNESS 

Seligman (1970) has defined a continuum of preparedness, which may help us 
grasp the significance of these facts. According to Seligman, animals are dif¬ 
ferentially prepared to associate certain responses with certain stimuli. Prepared¬ 
ness can exist in any degree, from completely prepared to completely unprepared. 
The relative preparedness of an animal to connect a given response to a given 
stimulus is operationally defined by Seligman in terms of the number of pairings 
of the stimulus and response required for learning to occur. 

This is a somewhat circular definition. On the one hand, preparedness is 
defined in terms of the number of pairings required for learning to occur. Ease of 
learning, on the other hand, is indexed by degree of preparedness. An indepen¬ 
dent measure of preparedness would be needed to overcome this circularity. But 
the concept of preparedness does have some heuristic value. Animals may be 
extremely prepared to associate certain events, somewhat ready to associate 
others, and totally unprepared to associate still others. Seligman’s hypothesis 
challenges the traditional American interchangeable-parts position. 

Preparedness in Instrumental Conditioning 
Although few would argue with the fact that many diverse responses and stimuli 
can be associated in roughly comparable ways, a wide array of data suggests that 
the notion of equivalent associability must be tempered by a consideration of 
genetically and evolutionarily determined factors and predispositions that ani¬ 
mals, including humans, bring to the learning situation. 

Let us begin with some examples of unprepared instrumental conditioning. 
Konorski (1967) attempted to reinforce yawning in dogs with food. Konorski 
reports that, whereas other responses are easily trained, it is extremely difficult, if 
not impossible, to train a dog to yawn for food. The animal is unprepared to 
associate these events. Using monkeys, Bolles and Seelbach (1964) reinforced 
various responses with noise offset and punished others with noise onset. Some 
responses, such as exploratory behavior, could be strongly affected by. both noise 
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onset and offset, but grooming behavior was relatively unaffected by either event 
(see also Charlton, 1983). 

In exploring prepared instrumental conditioning Brown and Jenkins (1968) ran 
an interesting experiment that represents an instance of extreme preparedness. 
Pigeons were exposed to a lighted key. Grain was presented each time the key was 
lit, regardless of what the bird did. The pigeon was not required to do anything. It 
was free to stand there and receive and eat the grain as it was delivered. But the 
pigeon began to peck the light despite the fact that no response was required for 
the delivery of food. The key-pecking behavior was maintained even though it 
had no effect on food delivery. This unique effect has been called “autoshaping,” 
and we shall discuss it in some detail in Chapter 6. For now, think of it as an 
example of extreme preparedness in an instrumental situation (see also Brandon 
& Bitterman, 1979; Durlach, 1983; Pisacreta, Redwood, &C Witt, 1983). 

Preparedness in Discrimination Learning 
Discrimination learning involves the presentation of two stimuli, one positive and 
one negative. If the animal responds to the positive stimulus, it is reinforced. If it 
responds to the negative stimulus, it is not reinforced. Several investigators 
(Dobrzecka & Konorski, 1967, 1968; Lawicka, 1964; Szwejkowska, 1967) have 
found dogs quite ready to discriminate among certain types of stimuli in certain 
situations, but not others. For example, they attempted to teach dogs either to go 
and receive food or to stay, based on verbal commands. One voice tone from the 
experimenter was the positive stimulus, whereas a different voice tone from the 
same experimenter was the negative stimulus. The animals mastered this discrim¬ 
ination easily, going in response to one tone and staying in response to the other. 
But when the positive and negative stimuli were changed, quite different results 
were obtained. If a given voice tone came from above the dog, it represented the 
positive stimulus. If the same voice tone, from the same experimenter, came from 
below the dog, it represented the negative stimulus. In this case the dogs had a 
great deal of trouble learning the discrimination. These and other results showed 
that the animals were not prepared at all to use the location of the voice as a cue 
in deciding whether to go or stay. 

Humans Are Prepared—Sometimes 

Are humans prepared to associate certain events and unprepared to associate 
others? Even though much of the “evidence” is speculative and anecdotal, it does 
seem that humans are more ready to learn certain things than they are others. 

Let us begin with some behaviors that seem to be acquired easily, almost 
automatically. Lenneberg (1967) has argued that we are very much prepared to 
learn an important class of materials—language. As we shall see in Chapter 13, 
we seem to learn language without a great deal of elaborate training. Human 
children acquire language with ease, and they seem to do it in all cultures in 
roughly the same manner. Adult humans do not have to set up elaborate training 
procedures to ensure that, their children will acquire language. It is accomplished 
efficiently. The child seems prepared to acquire language. We utilize language 
almost incessantly and in an almost effortless stream. 

There are other examples of human preparedness. For example, humans learn 
to walk with a minimum of effort and training. Walking seems almost automatic 
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Figure 4.1 Newborn infants will, when held erect, engage in many of the complicated 
movements associated with walking. 

and innate. Some investigators, in fact, classify walking as an innate behavior, or 
one that requires no learning at all. Whether walking turns out to be purely innate 
is still open to question. The fact that some feral children have been found to walk 
on all fours suggests that upright walking is not purely innate. But we can be sure 
that upright walking is at the very least an extremely prepared sort of behavior. 
As shown in Figure 4.1, even newborn infants will engage in many of the complex 
movements associated with walking. Even though they cannot hold themselves 
upright, these infants are ready to move in this manner. 

Infants tend to grasp with their hands. This behavior, too, is often described as 
innate. Whatever it is, it is an extremely prepared activity, and one that needs 
little, if any, practice to be established very firmly (see Figure 4.2). 

Sucking, smiling, laughing, and crying are additional examples of highly pre¬ 
pared human behaviors. Eibl-Eibesfeldt (1972) reports that all people, regardless 



Figure 4.2 The infant’s hand-grasping behavior is extremely prepared, perhaps requiring 
no learning at all. Note that the middle fingers are closed first, and the thumb,- last. 
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Figure 4.3 The possibly innate eyebrow flick in, from top to bottom, the Waika Indian, 
Balinese, and Papuan cultures. In each case the right-hand expression occurs after 
recognition, and the left-hand expression occurs before recognition. (From Love and 
Hate by Irenaus Eibl-Eibesfeldt. English translation copyright © 1971, 1972 by Methuen 
and Co. Ltd. and Holt, Rinehart and Winston, Inc. Originally published in Germany 
under the title Liebe und Hass: Znr Naturgescbishte Elementarer Verhaltensweisen by 
R. Piper and Co. Verlag, Munich 1970. © R. Piper & Co. Verlag, Munich 1970. Re¬ 
printed by permission of Holt, Rinehart and Winston, Publishers. 

of the culture they live in, momentarily flick their eyebrows upward when they 
recognize someone (see Figure 4.3). This eyebrow-flick response is so pervasive 
and so widespread that we are led to believe or at least strongly suspect that it is 
an innate, extremely prepared response. 

Eibl-Eibesfeldt’s observations of deaf and blind individuals lend further support 
to the idea that many .human behaviors are highly prepared and that their 
occurrence is not dependent on extensive training. For example, Figure 4.4 shows 
that a deaf and blind girl has many of the same facial expressions that hearing and 
sighted individuals do even though she has never had the opportunity to learn 
these expressions through the use of hearing and vision. 
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Figure 4.4 Possibly innate facial expressions in a deaf and blind female, including 
relaxation, smiling, and crying. (From Love and Hate by Irenaus Eibl-Eibesfeldt. English 
translation copyright © 1971, 1972 by Methuen and Co. Ltd. and Holt, Rinehart and 
Winston, Inc. Originally published in Germany under the title Liebe und Hass: Zur 
Naturgeschishte Elementarer Verhaltensweisen by R. Piper and Co. Verlag, Munich 
1970. © R. Piper & Co. Verlag, Munich 1970. Reprinted by permission of Holt, 

Rinehart and Winston, Publishers. 

Proceeding to the other end of the preparedness continuum, we find negative 
constraints on human behavior, just as on animal behavior. In other words, there 
are some behaviors that seem difficult for humans to learn. Every parent knows 
only too well, for example, how difficult it is to teach children to be polite and to 
brush their teeth. Nursery school teachers are often struck by the difficulties 
involved in establishing “sharing” behavior (McCarthy & Houston, 1980). 
Youngsters learn to run, jump, quarrel, talk, and the like long before they even 
grasp the rudiments of sharing. In fact, many teachers of young children assume 
that, if they can encourage even a “readiness” to share, they are doing well. 
Somehow, sharing is just not one of our most highly prepared behaviors. Even 
when reinforced regularly, it is difficult to establish and maintain. 

As a final example of unprepared human learning, consider biofeedback train¬ 
ing. As you will recall, an example of this might involve learning to increase or 
decrease an internal “involuntary” process such as heart rate. This learning 
process is gradual, difficult, and sometimes almost impossible. It requires compli¬ 
cated and exacting experimental training procedures. Humans seem relatively 
unprepared to learn to control internal “involuntary” responses. 
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TASTE AVERSION: BAIT SHYNESS 

Exterminators, or those who poison animals for commercial purposes, are famil¬ 
iar with the fact that, when an animal eats poisoned bait and survives, it develops 
an aversion for the flavor of that particular bait. The flavor becomes in some way 
a danger signal, similar to what is experienced by people who become wary of a 
particular delicacy that has made them ill. 

Psychologists have brought this phenomenon into the laboratory and have 
discovered some interesting things about it. They have shown that animals learn 
quickly to avoid distinctively flavored solution^ when ingestion of these solutions 
is followed by gastrointestinal distress or some other type of malaise. Although 
this avoidance is not limited to flavors (odors can become aversive as well), 
journals in the field are filled with articles concerned with this so-called taste- 
aversion effect (Domjan, 1980; Garcia, Ervin, & Koelling, 1966; Garcia, Ervin, 
Yorke, & Koelling, 1967; Garcia & Rusiniak, 1979; Garcia, Rusiniak, & Brett, 
1977; Mitchell, Kirschbaum, & Perry, 1975; Monroe & Barker, 1979; Riley, 
Jacobs, & Mastropaolo, 1983; Rozin, 1967, 1968, 1969). In a typical experiment, 
rats are given saccharin-flavored water and simultaneously exposed to X rays. X 
irradiation makes rats sick after an hour or so. It has been found that after 
drinking and becoming ill the rats subsequently show a distinct aversion for the 
saccharin flavor. They somehow associate the flavor with the illness. The aversion 
can persist through many weeks of preference testing. 

A great deal of work has been done with these kinds of taste aversion. Garcia, 
Hankins, and Rusiniak (1974), Garcia and Rusiniak (1979), and Garcia, Rusi¬ 
niak, and Brett (1977) point out that similar effects have been obtained with rats, 
mice, cats, monkeys, ferrets, coyotes, birds, fish, and reptiles. Flavors have in¬ 
cluded sweet, sour, bitter, salty, coffee, fruit juice, and milk. Malaise has been 
caused by ingested toxins, injected drugs, X rays, gamma rays, neutron bombard¬ 
ment, transfusion of blood from irradiated donor animals, and motion sickness. 
Garcia, Hankins, and Rusiniak (1974) also report some unpublished data indicat¬ 
ing that humans find foods distasteful that are coincidentally followed by sick¬ 
ness. Bernstein (1968) reports that children undergoing chemotherapy can de¬ 
velop a learned taste aversion for ice cream. Furthermore, as we shall see, learned 
aversions for particular odors and sights can be established under certain cir¬ 
cumstances. 

Aversion for more than just external stimuli can be established. Peters (1983) 
has shown that an aversion to copulatory behavior in male rats can be established 
by pairing such activity with the injection of lithium chloride, a chemical that 
makes the animal ill. And finally, the true complexity of the situation is under¬ 
scored by Pelchat, Grill, Rozin, and Jacobs (1983), who point out that the nature 
of the aversive response depends on the nature of the discomfort. Suppose that 
two people eat shrimp and that one gets food poisoning, whereas the other 
develops hives. The poisoned individual, on the one hand, may not only avoid 
shrimp but also come to intensely dislike them. The hives sufferer, on the other 
hand, may avoid shrimp but still like the flavor. Thus we can see that bait shyness 
is a widespread and complex phenomenon. Many different stimuli, responses, 
animals, and noxious elements have been utilized. 

Garcia, Hankins, and Rusiniak (1974) summarized some of the general princi- 
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pies that have emerged from all this research. They report that other things being 
equal the stronger the flavor, the greater the aversion. In addition, the more severe 
the illness, the stronger the aversion. Finally, the strength of the aversion de¬ 
creases as the time between ingestion and illness is increased. This is all as we 
might expect. Strongly flavored foods followed by immediate, severe illness will 
yield the greatest aversion. 

The Relationship Between Bait Shyness and Avoidance Conditioning 
We can see that the baitr-shyness phenomenon possesses many, but perhaps not 
all, of the qualities of the avoidance-conditioning paradigm. You will recall that 
Mowrer conceived of avoidance conditioning as a two-step process. “Fear,” or 
some other emotional response, is first classically conditioned and then in- 
strumentally avoided. In the bait-shyness situation the flavor can be considered a 
CS, and the noxious agent, the UCS. The UCR is the illness caused by the UCS. 
We pair the flavor (CS) with the UCS (noxious agent). Through this pairing the 
CS (flavor) comes to elicit a CR that in some sense approximates the UCR 
(illness). The presentation of the flavor (CS) elicits a “conditioned illness.” The 
animal then avoids, or escapes from, the CS and its associated unpleasantness. In 
a sense, the bait-shyness effect is similar to the rat’s jumping a barrier to escape 
from a distinctive compartment that has been associated with shock. 

But there is at least one important difference between bait shyness and the 
typical avoidance-conditioning situation. In the bait-shyness learning situation 
there is often a long delay (sometimes hours) between presentation of the CS 
(flavor) and occurrence of the illness. Most of the conditioning situations de¬ 
scribed to this point involve CS—UCR intervals of no more than a few seconds. 
And CS—UCR intervals of more than a few seconds usually result in very poor 
conditioning. No one knows for sure why conditioning can be obtained in the 
bait-shyness situation with these long delays. Some feel that bait shyness repre¬ 
sents a different kind of learning altogether. Others feel that some trace of the CS 
(flavor) is brought forward in time such that it does occur, or exist, at the time the 
illness occurs. This latter position holds that bait shyness involves ordinary trace 
classical conditioning and does not represent a new or different variety (see also 
Logue, 1979). A detailed consideration of this interesting issue will be presented 
in the next chapter in our discussion of the general issue of stimulus—response 
contiguity (see also Domjan, 1980). 

Preparedness and Bait Shyness 

Garcia and Koelling (1966) ran an experiment in which several things happened 
to rats all at once. While the animals were drinking saccharin-flavored water, they 
were X-irradiated; in addition, lights flashed and a noise sounded at the same 
time. The question was which of the CS elements (flavor, sound, or light) would 
become aversive. Interestingly, only the flavor became aversive. The animals 
showed no aversion at all to the flashing light or sound. The argument is that the 
rats were unprepared to associate lights and noise with illness but were prepared 
to associate flavor with illness. The investigators ran another experiment, in 
which flavor, noise, and flashing lights were paired with foot shock rather than X 
irradiation. In this case the light and noise become aversive, but the flavor did not. 
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All this seems to fit some kind of logical schema, probably because all of it is 
adaptive. Natural selection would favor animals prepared to associate illness with 
taste and not with other external environmental events. It would favor animals 
predisposed to associate foot pain with external events rather than with what they 
had eaten. The reverse is ridiculous and maladaptive from an evolutionary 
standpoint. Animals predisposed to avoid the flavors that they experienced when 
their feet hurt probably would not last very long. 

The ability to learn these aversion responses seems to be at least partially 
determined by genetic mechanisms. This idea is Supported by the fact that strains 
of rats can be selectively bred for strong and weak acquisition of bait-shyness 
responses (Hobbs & Elkins, 1983; Maggio & Harder, 1983). 

This kind of experimentation has led to the generalization that some animals, 
such as the rat, are phylogenetically prepared to associate internal events (such as 
illness) with internal cues (such as taste sensations) but not with external cues 
(such as noises). All this is part and parcel of the notion that biology places 
complex constraints on learning. What we have learned in this section is not only 
that different stimuli and responses are unequally learnable but also that different 
sensory systems in different species are unequally capable of handling and encod¬ 
ing different forms of information (see Best, Best, & Mickley, 1973; Garcia & 
Rusiniak, 1979). 

Novelty and Taste Aversion: Preexposure Effects 
Suppose that a rat drinks a flavored solution and is injected with lithium chloride. 
Then, an hour later, the rat drinks another, differently flavored solution. Finally, 2 
hours after the initial injection, the animal becomes ill. To which of the two 
flavors will the aversion be developed? Even though the lithium chloride was 
injected when the animal drank the first solution, what will keep the animal from 
associating the illness with the more recent flavor? Although this is a complicated 
situation, it appears that the more novel , or new, a flavor, the more likely it is to 
be picked up by the animal as the culprit. If the animal has never tasted the first 
flavor, whereas the second flavor is familiar, then that first flavor tends to become 
aversive. But if the second flavor is more novel than the first, then the animal may 
avoid it rather than the first flavor, which was, in fact, the one that occurred 
during lithium chloride poisoning. The animal could “make an error” here by 
avoiding the wrong flavor. If you went to a party and were poisoned by turkey 
just before you ate some harmless snake meat, the next day you might suspect the 
snake meat because it was novel. 

Dawley (1979) presents some evidence that supports this novelty interpreta¬ 
tion. He found that, when rats were repeatedly given a particular flavor before the 
flavor was paired with lithium chloride, an aversion toward that flavor developed 
slowly. In other words, preexposure to the flavor reduced its novelty value. 
Similar results have been reported by Riley, Jacobs, and Mastropaolo (1983). The 
importance of novelty in aversion learning has been extended by Kurz and 
Levitsky (1983), who have shown that rats will avoid a novel place as well as a 
novel taste when they are injected with lithium chloride. If you get sick after 
eating a meal on your first submarine ride, you may in the future avoid sub¬ 
marines as well as the type of food you ate. 
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Bait Shyness and the Control of Predation 
Aside from its theoretical implications bait shyness has some practical relevance. 
As is widely known, there is a continuing conflict between conservationists and 
stockmen over the coyote and its undeniable tendency to eat little lambs. Natural¬ 
ists argue that the coyote, which occupies an important niche in the ecology of 
our lands, should be preserved. Stockmen think of the coyote as a pest and a killer 
that should be eliminated. The stockmen’s view is that the coyote must go so that 
the sheep may remain. As a result coyotes are hunted and poisoned in enormous 
numbers. 

The bait-shyness phenomenon may offer a solution that will allow both the 
prey and the predator to survive. Gustavson, Garcia, Hankins, and Rusiniak 
(1974) tested the effectiveness of predation control through bait shyness under 
quite natural conditions. They fed coyotes lamb flesh dosed with lithium chloride, 
which causes the coyote to be extremely ill. After several such meals and subse¬ 
quent illnesses the coyotes were offered the opportunity to attack live lambs and 
live rabbits. The coyotes refused to attack the lambs. The coyotes, in fact, ran 
from them. Their predatory habit had been effectively curbed. But they were more 
than willing to attack rabbits after having eaten treated lamb flesh. The aversion 
was specific to lamb flesh. Further experimentation indicated that the consump¬ 
tion of treated rabbit flesh would inhibit attacks on rabbits but not on lambs. 

Garcia, Hankins, and Rusiniak (1974) note that this kind of research could 
easily lead to reasonable predation-control programs. Bait that smells and tastes 
like lamb and has been treated with lithium chloride could be scattered about the 
country side. Coyotes and wolves would consume the bait, experience the illness, 
and refrain from attacking lambs. They would restrict their attacks to rabbits and 
other natural prey, thereby protecting both ranchers’ stock and the balance of the 
ecosystem. This kind of predation-control program seems especially promising 
when one considers that it may well be applicable to many different types of prey 
and predator. 


Bait Shyness and Aversion Therapy 

As we learned in Chapter 2, there is a treatment used with humans that resembles 
the bait-shyness effect (see Fantino, 1973). In aversion therapy humans are 
stimulated with some strong noxious event while they are engaging in the be¬ 
havior to be controlled (smoking, drinking, eating). The most popular kinds of 
noxious event are electric shock and drugs that induce illness (Sherman, 1973). 
The “bad habit,” or the object of the habit, when paired with a noxious event, 
becomes aversive in much the same way that the flavor of a particular food 
becomes aversive in the bait-shyness effect. Wolpe and Lazarus (1966) describe 
typical procedures using electric shock. Electrodes are attached to the subjects’ 
arms. The level of shock employed as a noxious stimulus is often just beyond the 
point at which the subjects describe it as being distinctly unpleasant. The shock is 
presented either while they are actually engaged in the unwanted behavior or 
when they are imagining that they are engaged in it. 

The technique in which the subject imagines rather than engages in the be¬ 
havior appears particularly useful when the target behavior cannot easily, or 
practically, be brought into the treatment situation. For example, Lazarus (1960) 
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reports a case in which a 10-year-old boy had developed the habit of waking in 
the middle of the night and going to his mother’s bed. Needless to say, this 
behavior, which occurred regularly each night, began to put some strain on the 
family relationships. No amount of parental bribery, punishment, or reward 
seemed to help. Aversion therapy finally terminated the behavior. Electrodes were 
attached to the boy’s arm, and he was asked to imagine that he was in his 
mother’s bed. When he felt that he had a good strong image of the requested 
situation, he was to say “mother’s bed.” At that moment the experimenter turned 
on the current. When the patient could no longer tolerate the shock (typically 
after a few seconds) he was to say “my bed.” At that point the shock was 
terminated. The treatment appears to have been very successful. The child reg¬ 
ularly slept in his own bed thereafter. The point is that the child did not actually 
have to be shocked in his mother’s bed. An image of the situation was sufficient. 

It should be noted, however, that aversive control is not always so successful. 
For example, it has been only mildly successful with alcoholism. In other words, 
although aversion therapy can be helpful in many situations, it is far from a 
“miracle” technique. 

SPECIES-SPECIFIC DEFENSE REACTIONS 
This chapter has been speaking of a continuum of preparedness but has said little 
about why certain events are associated more easily than others. It has been noted 
very generally that the pressures of natural selection probably contribute to the 
existence of the continuum. But Bolles (1970) has gone further in attempting to 
explain at least some differences in preparedness. He has presented a very persua¬ 
sive argument concerning the nature of avoidance conditioning. The problem he 
addressed is that some avoidance responses are easily learned by certain species, 
whereas others are not. For instance, rats will learn to jump or run to avoid shock 
(Baum, 1969; Miller, 1951), but they have a difficult time learning to press levers 
to avoid shock (D’Amato & Fazzaro, 1966; Fantino, Sharp, & Cole, 1966). The 
same is true of pigeons. They will learn to fly away to avoid something unpleasant 
(Bedford & Anger, 1968), but it is difficult to train them to peck a key to avoid 
negative stimuli (Azrin, 1959; Rachlin & Hineline, 1967). Why do these differ¬ 
ences exist? Bolles (1970) argues that animals have species-specific defense reac¬ 
tions (SSDRs), such as running, flying, freezing, or fighting. These are innate, 
automatic behaviors that occur in response to any novel or sudden stimulus event. 
He argues that these innate defense reactions are those that are established easily 
in avoidance-conditioning experiments. Arbitrary responses, such as the bar 
press, which have been selected for the psychologist’s convenience, are not ac¬ 
quired easily. Bolles maintains that, if a particular avoidance response is acquired 
easily, then it must necessarily be an SSDR. 

He illustrates his case with a little fable. A furry little animal ran about the 
forest eating and copulating as much as possible. One day it was attacked by a 
large predator. Luckily, it escaped. It was hurt and frightened, but nothing more. 
Some time later, while running through the forest again, it perceived a con¬ 
ditioned stimulus.'That is, it heard, saw, or smelled something that had preceded 
the first attack. It was frightened by this sign, or cue, and ran for safety. From that 
day on the little animal successfully avoided attack by the bad predator. The fable 
fits the ordinary conception of avoidance learning. Based on Mowrer’s two-factor 


92 PART ONE/INTRODUCTION 




theory, one would argue that fear had been classically conditioned to the cues 
preceding the first attack. Later these cues produced fear, which was in- 
strumentally avoided. Fear reduction reinforced the escape response. 

Bolles argues that this reasoning is “utter nonsense.” First, predators do not 
present cues just before they attack. Owls do not hoot 2.5 seconds before they hit 
their prey. Coyotes do not consistently give recognizable signs before they 
pounce. The whole idea, we must assume, is for predators to minimize such 
telltale signs. They might occasionally slip up, but Bolles points out that pred¬ 
ators do not allow their prey enough trials for ordinary avoidance conditioning to 
occur. Mice do not generally escape from owls dozens of times. Bolles suggests 
that the little animals stay alive not because of laboratory-type avoidance con¬ 
ditioning but because of innate SSDRs. Whenever the animal is faced with novel 
or sudden stimuli, it flies, freezes, or runs innately and automatically. It has not 
learned these responses through narrow escapes. They are automatic responses to 
novel or sudden stimulus situations. As Bolles points out, gazelles do not run from 
lions because they have been bitten by lions. They automatically run from any 
large object that approaches them (see Blanchard, Mast, & Blanchard, 1975). 
Prey will react with an SSDR to the sudden presentation of a harmless stimulus as 
well as to the sudden appearance of a truly dangerous predator. 

One can see how Bolles’s hypothesis might account for our laboratory data. 
Rats may well come into our experiments neurally wired, or prepared, to run in 
response to danger or any other sudden stimulus change. Bar pressing, however, is 
not one of their SSDRs and is therefore difficult to establish. 

Obviously, Bolles’s hypothesis does not answer all our questions (see Crawford, 
Masterson, & Wilson, 1977). It is an interpretation of the limited avoidance¬ 
conditioning situation and does not account for other types of preparedness. But 
it is provocative and does suggest that we must consider the innate survival 
mechanisms that the animal brings to our mechanized, arbitrary, laboratory 
learning situations. 


INSTINCTIVE DRIFT 

Instinctive drift is an interesting concept developed by Breland and Breland (1961, 
1966). It is relevant to our consideration of preparedness, and an example or two 
should clarify its meaning. The Brelands attempted to train a raccoon to pick up 
coins and place them in a 5-inch metal box. Raccoons have the reputation of 
being tractable, eager, and quite intelligent. Hence, the Brelands expected little 
difficulty in establishing this particular response sequence. At first things pro¬ 
ceeded fairly well. Each time the raccoon dropped the coin in the box, it was 
reinforced with food. But after a while, instead of promptly dropping the coins in 
the box, the raccoon spent seconds, and sometimes minutes, rubbing the coins 
together and dipping them in the box. All it had to do was let go to be reinforced. 
But it seemed to have difficulty in letting go at all. As time went on, the rubbing 
and dipping behavior became still stronger even though it resulted in fewer and 
fewer reinforcements. It seemed as though the raccoon’s innate “washing be¬ 
havior” had crept into the conditioning situation. Parenthetically, the Brelands 
suggest that the so-called washing behavior may help raccoons break away the 
exoskeleton of their natural prey, the crayfish. 

In another situation Breland and Breland attempted to train pigs to pick up 
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large wooden coins and place them in a bank. At first the pigs eagerly performed 
the correct responses, picking up, carrying, and depositing the coins without 
hesitation. But over a period of weeks the reinforced behavior deteriorated. The 
pigs began to drop the coins, root them, pick them up, drop them again, and root 
them again. In some instances the pigs finally required 10 minutes to carry four 
coins a distance of 6 feet. The pigs’ natural food-gathering responses appeared to 
intrude into the learning situation. These “disruptive” behaviors seemed to 
violate what is typically thought to be true of learning situations. That is, they 
used more energy than the simple, required responses. In addition, they resulted 
in fewer reinforcements. 

In attempting to account for this type of “misbehavior” Breland and Breland 
(1961) write: 

It seems obvious that these animals are trapped by strongly instinctive 
behaviors, and clearly we have here a demonstration of the prepotency of 
such behavior patterns over those which have been conditioned. 

We have termed this phenomenon “instinctive drift.” The general principle 
seems to be that whenever an animal has strong instinctive behaviors in the 
area of the conditioned response, after continued running the organism will 
drift toward the instinctive behavior to the detriment of the conditioned 
behavior and even to the delay or preclusion of the reinforcement. In a very 
boiled-down, simplified form, it might be stated as “learned behavior drifts 
toward instinctive behavior.” (p. 69) 

The Brelands conclude that all this is not to disparage the use of learning 
principles. It merely suggests that the assumptions underlying what we have 
termed the interchangeable-parts conception of learning must be tempered by a 
consideration of the animal’s instinctive apparatus. The Brelands assert that the 
animal does not come to the laboratory as a tabula rasa , that species differences 
are important, and that all responses are not equally conditionable to all stimuli. 

ETHOLOGY: INNATE BEHAVIOR 

Concepts such as SSDRs and instinctive drift underscore the fact that animals are 
extremely well prepared to display certain behaviors. It seems as though certain 
response systems are built into the organism. These responses are not dependent 
on learning; they will occur automatically when the appropriate environmental 
stimuli are perceived. A gazelle does not have to learn to run when it sees a lion; 
the running response occurs automatically when the lion is seen. Such released 
responses are called innate responses; they represent the most prepared of all 
behaviors. 

Most psychologists would agree that some behaviors are clearly innate, some 
learned, and some probably represent a complex mixture of learned and innate 
components (see Haig, Rawlins, Olton, Mead, & Taylor, 1983). Exactly how 
innate and learned behavior interact and affect each other remains an intriguing 
question. 

It is within ethology, a subdivision of biology concerned with behavior in 
relation to the environment, that many of these issues about innate behavior are 
addressed. Hence, we shall spend a little time learning about this field. Etholo- 
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gists, represented by illustrious Europeans such as Konrad Lorenz, Nikolaas 
Tinbergen, and Karl von Frisch, believe that a true understanding of behavior 
must include a consideration of innate factors as well as the role of learning. They 
reject the interchangeable-parts conception of behavior; to the contrary, they 
believe that much of behavior is predetermined and precisely defined by genetic 
and biological factors. 


Six Concepts 

REACTION-SPECIFIC ENERGY. Tinbergen (1952) has observed that the male stick¬ 
leback fish will attack and drive off other males intruding into its territory but will 
not attack females. We can use this interesting bit of behavior to introduce the 
main concepts developed by the ethologists to account for innate behavior. The 
term reaction-specific energy refers to the idea that there is a pool, or source, of 
energy that exists only for the occurrence of a particular behavior (for example, 
the stickleback’s attack behavior). 

INNATE RELEASING MECHANISM. But we know that the stickleback doesn’t 
attack constantly. It attacks only under certain conditions. To account for this 
fact the ethologists speak of an innate releasing mechanism. Think of this factor 
as a plug in a basin. It stops up and contains the energy that would otherwise flow 
and would be expressed as attack behavior. 

SIGN STIMULUS. A sign stimulus is the stimulus that, through being perceived by 
the organism, pulls the plug and releases the reaction-specific energy. Sign stimuli 
tend to be fairly simple, but they have an enormous impact on the animal. For 
example, the sign stimulus that releases stickleback attack behavior is the red 
belly of the intruding male. Nothing but that red belly will do the job. Perfect 
replicas of male intruders without red bellies are ignored. But very unrealistic 
replicas ivitk red bellies are attacked fiercely. Sign stimuli are assumed to innately 
open the flood gates without being dependent on learning. 

FIXED ACTION PATTERN. The rigid, stereotyped attack behavior released by the 
sign stimulus is called a fixed, action pattern. When the stickleback sees a red 
belly, it attacks and drives the intruder away regardless of what else is going on in 
the environment. 

VACUUM REACTIONS. Sometimes, when the male has not aggressed for a long 
period (because it has not seen a red belly), it will attack anyway. It may attack a 
stimulus that only remotely resembles the normal sign stimulus (for example, a 
pink belly). In fact, given a long enough time without the appearance of a sign 
stimulus the fish may attack just about anything. This behavior, called a vacuum 
reaction , is explained by the idea that reaction-specific energy builds up to the 
point at which it blows out the cork by itself without the cork’s having been 
removed by a sign stimulus. 

SUPERNORMAL STIMULI. Given a choice between its own egg and a larger egg 
some birds, such as the oyster catcher, will incubate the larger egg. Given a choice 
between its own brown spotted egg and a black spotted egg some birds, such as 
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Figure 4.5 The hydraulic model of instinctive behavior. (Adapted from Lorenz, K. The 
comparative method of studying innate behavior patterns. (Symposia of the Society of 
Experimental Biology. Cambridge, England: Cambridge University Press, 1950.) 

plovers, will choose the supernormal stimulus. In other words, these behaviors 
seem to be strictly controlled by the nature of the sign stimuli. 

The Hydraulic Model 

Tinbergen (1951) has proposed a hydraulic model of innate behavior, depicted in 
Figure 4.5. Reaction-specific energy builds up like water collecting behind a dam. 
The innate releasing mechanism holds the water in. A sign stimulus, depicted by 
the weight in the pan, will pull the stopper and release the accumulated energy. 
Thus, there are two factors that can release the energy. First, a sign stimulus can 
pull the plug. Second, the building pressure within the reservoir can lead to a 
“blowout” (and thus the occurrence of a vacuum reaction). Once released, the 
energy or water flows into a sloped container. The more water or energy, the more 
different holes in the pan it will flow through. Tinbergen used this sloped-pan 
mechanism to account for the fact that released behavior sometimes seems more 
complete than it does at other times. The stickleback is assumed to attack more 
fully and completely when the energy flows through more holes in the pan; the 
overall attack response is more complete. 

The most complete response will occur (1) when the sign stimulus is most 
appropriate, thereby exerting a greater pull on the plug, and (2) when a great deal 
of energy has accumulated, thereby exerting a greater push on the plug and a 
greater flow into the sloped pan. 
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Although ethology is quite popular, its methods and concept have not escaped 
criticism. Ethologists have not been able to disentangle learned and innate com¬ 
ponents of behavior. But they have been able to demonstrate and clarify for us the 
fact that innate influences do, in fact, exist and that they probably interact with 
modifications due to learning. Lorenz (1969) uses the example of the jackdaw to 
emphasize his belief that innate and learned elements interact to produce be¬ 
havior. The jackdaw knows, innately, that it must collect twigs and push them 
together to form a nest. But it doesn’t know, innately, which twigs are most 
suitable. This it must learn through a process of trial and error. Thus the 
ethologists, while not having all the answers, consistently show that any full and 
final understanding of behavior will have to include a consideration of innate 
factors. 


IMPRINTING 

We now turn to imprinting, which is important for two reasons. First, it is a 
substantial and intriguing area of investigation on its own. Second, it exemplifies 
the current problem we all have in trying to disentangle learned and innate 
elements in behavior. 

In its simplest form imprinting includes the fact that a newly hatched bird will 
approach, and form a social attachment to, the first moving object it encounters, 
whether it be the real parent or some parent surrogate. The longer the bird is 
exposed to the target object, the stronger the filial tendency becomes (Bateson, 
1966; Sluckin, 1965). Lorenz (1937) noticed that, if the first object a gosling 
encounters is a human, the gosling will approach that human. In the future the 
gosling will approach the human in preference to its real mother. After the egg 
cracks and the bird struggles out, it straightens up, dries out, takes a little time to 
get organized, and looks around. It approaches the first moving object it encoun¬ 
ters and becomes imprinted on it. 

Many different kinds of stimuli will serve as a “mother.” Objects that have been 
successfully used as parent surrogates include balloons and electric trains (Fabri- 
cius, 1951, 1955; Fabricius & Boyd, 1954), footballs (Ramsay, 1951), moving 
lines (Smith & Hoyes, 1961), animals of a different species (Baer & Gray, 1960), 
and colored boxes (Salzen &c Sluckin, 1959), to mention a few. Most of the 
experimental work has been done with various birds, but some authors argue that 
imprinting occurs to one degree or another in animals such as dogs, primates, and 
even humans (Hoffman & Ratner, 1973). We shall return to the question of 
imprinting in humans in a later section. 

We can see how this phenomenon might be adaptive. Those birds that wander 
off rather than staying close to their parent would probably be less likely to 
survive. But is imprinting learned, or is it innate and unlearned behavior? Does it 
represent some combination of learned and innate behaviors? No one quite 
knows, but these are the questions the psychologist and ethologist address. 

Sample Apparatus and Procedures 

Hess (1958, 1959, 1964, 1972) has done a good deal of influential experimental 
work on imprinting. A description of his procedures should provide a good 
example of the kind of experimental rigor obtainable in connection with the 
imprinting process. Hess’s (1959) apparatus is depicted in Figure 4.6. It consists of 
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Figure 4.6 The imprinting apparatus. (Adapted from Hess, E. H. Imprinting. Science , 
1959, 130, 133-141. Fig. 1, p. 134. Copyright 1959 by the American Association for the 
Advancement of Science.) 

a circular walkway enclosed with Plexiglas walls. The decoy, or object on which 
the duckling will be imprinted, is a model of a male mallard duck equipped with a 
loudspeaker, which will emit a human voice saying “gock, gock, gock, gock.” The 
decoy also contains a heating unit. Mallard ducklings are hatched in incubators 
and immediately placed in small cardboard boxes until they are used in the 
experiments. The notion here is that they will not become imprinted on anything 
in their dark little compartments. In the imprinting procedure the ducklings are 
released into the apparatus by remote control. The decoy begins to move and emit 
the “gock” sound. The little duck follows, round and round, presumably being 
imprinted on the decoy. When the experimenter wishes to terminate the imprint¬ 
ing procedure, the duckling is tumbled through a little trap door. 

As a test of the strength of the imprinting the experimenter then places two 
decoys in the apparatus. One is the male decoy they were imprinted on; the other 
might be a model of a female mallard emitting the recorded sounds of a real 
female duck. The duckling is reintroduced into the apparatus, and the experi¬ 
menter observes which decoy attracts it. During these kinds of test most duck¬ 
lings respond to the male decoy, thereby demonstrating the imprinting effect. 
Small blocks, or barriers, placed between the ducklings and the male model do 
not deter the little ducklings. They clamber over these obstacles to be near their 
“mother.” 

Experimenters can vary any number of factors in this apparatus. For instance, 
they can vary the nature of the decoy and the sound it emits. They can vary the 
time spent in the apparatus and the speed of the decoy. In one set of- experiments 
Hess wanted to know if it was the time the duckling spent with the decoy or the 
distance it traveled following the decoy that was important in determining the 
imprinting effect. Accordingly, in one condition Hess kept the time spent in the 
apparatus constant while varying the speed of the decoy. This enabled him to vary 
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Figure 4.7 Strength of imprinting as a function of the distance traveled, with exposure 
time held constant. (Adapted from Hess, E. H. Imprinting. Science , 1959, 130, 133—141. 
Fig. 6, 137. Copyright 1959 by the American Association for the Advancement of 
Science.) 

the distance traveled by the ducks in a given period. The results of this experiment 
are shown in Figure 4.7. Imprinting clearly increased in strength as the distance 
the duckling followed the decoy increased. In a companion experiment Hess 
varied the time the duckling spent in the presence of the decoy while keeping the 
distance traveled constant. In this case imprinting did not vary as a function of 
time. Thus, Hess concluded that the distance traveled is more important than the 
time of exposure in determining the strength of the imprinting effect. 

Natural Versus Laboratory Imprinting 
More recently, Hess (1973) has concluded that, to fully understand imprinting, 
we must study it as it occurs in natural settings, as well as in the laboratory. He 
feels that there are important differences between imprinting in the laboratory 
and in nature. He points out, for example, that imprinting occurring in natural 
settings seems much less reversible than imprinting obtained in the laboratory. 
Furthermore, he argues that the whole imprinting experience, from the little bird’s 
viewpoint, must be very different in the natural and laboratory settings. In the 
normal course of a laboratory study, eggs hatched in an incubator do not all hatch 
at once. Eggs hatched by the mother in the field all hatch within a few hours of 
one another. Although this seems a small difference to us, it may represent an 
enormous difference to a tiny duckling peeking out of its eggshell. 

Because he feels it is important to study imprinting under as natural conditions 
as possible, Hess (1973) has developed a new apparatus that combines, or 
integrates, natural qualities with laboratory control. Though not completely 
natural, this system contains many elements missing in most laboratory experi¬ 
ments, as shown in Figure 4.8. Specifically, even though the ducklings are hatched 
in an incubator in this apparatus, they are subjected to the clucking sounds made 
by a female mallard as it sits out in the field. The unhatched ducklings can not 
only hear the female but can also return vocalizations to it as well—all the while 
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Figure 4.8 Hess’s apparatus used to measure and control vocal interactions between 
unhatched ducklings in laboratory incubator and female mallard on nest in field. 

(Adapted from Hess, E. H. “Imprinting” in a natural laboratory. Scientific American, 
1972, 227, 24—31. Copyright © 1972 by Scientific American, Inc. All rights reserved.) 

enclosed in the shell. As we shall soon learn, these prehatching vocal interactions 
are critical in the overall imprinting process. 

Nonvisual Imprinting 

Traditionally, imprinting based on vision has been the subject of most research. 
The emphasis has been on birds hatching and looking around. But it now turns 
out that imprinting may involve other sensory channels as well. Sounds, for 
example, may be as important as sight in the overall imprinting event. Hess 
(1973) believes that vocal interaction between the female duck and its unhatched 
ducklings is part of, and strengthens, the overall imprinting event. In a sense, 
imprinting has already begun before the eggs are hatched. The female’s vocaliza¬ 
tion appears to increase as hatching approaches. The discovery of this form of 
interaction is important because it suggests that imprinting is more complicated 
than was first thought. Specifically, it suggests that imprinting involves multiple 
sensory channels. That is, imprinting involves sound as well as sight, and other 
forms of sensory stimulation as well. 

Imprinting can occur on the basis of scent. Fantino and Logan (1979) point out 
that goats and European shrews will imprint on odor. Baby shrews, for example, 
will imprint on the first piece of cloth soaked in a distinctive odor that is presented 
to them. (Shrews, by the way, are small mouselike mammals with a voracious 
appetite.) Thus, imprinting appears to be a complex event (see also Brown & 
Hamilton, 1977). We now turn to a detailed consideration of whether imprinting 
is innate, learned, or some combination of the two. 

Imprinted Behavior as Innate Behavior 
ETHOLOGICAL INTERPRETATIONS. It is not difficult to grasp the primary con¬ 
troversy surrounding the nature of imprinting. Although ethologists sometimes 
disagree (see Hess, 1970; Thorpe, 1956), they have argued that imprinted be- 
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havior is genetically determined. That is, when the appropriate moving stimulus 
is perceived, the young bird automatically and in a predetermined fashion 
approaches and follows that object. It seems as if the behavior pattern is already 
there, waiting to be released by the appropriate stimulus. 

As we have seen, ethologists speak of sign stimuli and fixed action patterns. 
Sign stimuli are a certain limited range of critical stimuli that, when perceived, 
automatically release a fixed action pattern (for example, approach and follow). 
Imprinting, according to this view, is a built-in mechanism. Whenever one of the 
critical sign stimuli is perceived, the fixed action pattern is triggered, or un¬ 
blocked. In this interpretation there is no learning. Practice is unnecessary, and 
reinforcement is unnecessary. 

Rajecki (1973) has argued against this interpretation on two grounds. First, he 
challenges the concept of a limited range of sign stimuli. According to the 
ethologists themselves, not all stimuli are releasers. Only a certain set of stimuli is 
supposed to be effective in unblocking the fixed action pattern. But Rajecki points 
out that an enormous number of stimuli have been found to be effective sign 
stimuli—for example, blinking lights (James, 1959, 1960) and live hawks (Mel¬ 
vin, Cloar, & Massingill, 1967). It may be that all of these varied stimuli are 
included in some range, but the range is so wide and so diverse that the concept of 
a limited set of releasers is of little value. It does not help us understand what is 
going on. Second, Rajecki (1973) questions the notion that the fixed action 
pattern is really all that fixed. It seems that the ethologists wish to conclude that 
the pattern consists of a limited set of prescribed behaviors (for example, 
approach and follow and nothing more). But Rajecki notes a case in which a 
domestic chick not only became imprinted on a swan goose but copied its feeding 
habits and some of its calls as well. The fixed action pattern seems to be less fixed 
than the ethologists would have us believe. 

Thus, although the sign stimulus—fixed action pattern sequence is appealing 
and does account for a good deal of the data, it seems that there is more to 
imprinting than this simple concept would imply. The concept of a limited set of 
sign stimuli and fixed behaviors cannot easily account for the variability in both 
imprinting stimuli and imprinted responses. 

THE CRITICAL PERIOD. Because the sign stimulus-fixed action pattern is unsatis¬ 
factory, we need to find something else that would suggest that the imprinting 
process is different from ordinary learning. Very early observations (Heinroth, 
1911; James, 1890; Lorenz, 1935, 1937; Spalding, 1873) led to the conclusion 
that birds will form social attachments with moving objects but that such attach¬ 
ments will be formed only during a very brief period soon after hatching. It seems 
that it takes a little time before imprinting can occur, presumably because the very 
young bird is still a little shaky and disorganized. Then there ensues a brief period 
during which imprinting readily occurs. Following this “critical period” there 
seems to be a lessening of the tendency to imprint and an increase in the tendency 
to flee from any novel stimulus. 

These early field observations have been substantiated in the laboratory. Hess 
(1959) and Hess and Schaefer (1959) varied the ages of ducklings to be imprinted. 
That is, the ducklings were kept safely in their cardboard containers for varying 
amounts of time before they were exposed to the male mallard decoy. The results, 
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Figure 4.9 Imprinting as a function of age. (Adapted from Hess, E. H. Imprinting. 

Science, 1959, 130, 133-141. Fig. 2, p. 135. Copyright 1959 by the American 
Association for the Advancement of Science.) 

depicted in Figure 4.9, indicate that from 13 to 16 hours old seems to be the best 
age to obtain the imprinting phenomenon. The point Hess wishes to make is that 
ordinary learning is not restricted to a critical period. It can occur more or less 
readily at any age level. Hess (1964) argues that the critical period is one of several 
characteristics that set imprinting apart from most learned behavior (see also 
Gallagher, 1977). 

Unfortunately, the notion of a critical period does not set imprinting apart from 
ordinary learning. Hess himself (Hess, 1959) sets the stage for a rejection of the 
notion. He argues that the rise and fall in the effectiveness of the imprinting 
procedures result from two interacting factors. First, the initial rise in imprinting 
may stem from the duck’s growing locomotor capacity after hatching. That is, the 
very young duck may want to imprint, but its little legs are so wobbly that it 
cannot get up on them and get the job done. Second, the decrease in imprinting 
after age 13-16 hours seems to result from the duck’s growing capacity to 
experience fear and its resulting unwillingness to follow novel stimuli. Thus, the 
duck may possess the capacity to imprint beyond the critical period, but it does 
not normally do so because it has gradually developed the capacity to fear novel 
objects and does not follow them after a certain age. 

Several experiments suggest that fear does, in fact, block imprinting beyond the 
critical period (Bateson, 1964, 1969; Hoffman, Ratner, & Eisener, 1972; Sluckin 
& Salzen, 1961). Basically, all these studies exposed the bird to the harmless but 
fear-producing novel stimulus for a prolonged period of time. They did not allow 
the bird to flee. As time went on, the bird discovered that the stimulus would do 
no harm. Once fear was reduced, these older birds began to form social attach¬ 
ments to the stimuli. In .a nutshell, imprinting may not be limited to a critical 
period. It may merely be blocked by a normal pattern of developing fear reac¬ 
tions. If the fear is removed, imprinting occurs beyond the so-called critical 
period. 
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OTHER DIFFERENCES? Hess and his associates (Hess, 1964; Kovach 8c Hess, 
1963) have suggested that there are a number of other important differences 
between imprinting and ordinary learning. Unfortunately, none of these potential 
differences seems to have been established to everyone’s satisfaction. 

For example, Kovach 8c Hess (1963) have argued that in ordinary learning 
situations noxious stimulation presented in the presence of a particular object will 
lead to avoidance of that object. But in their experiments, noxious stimulation 
(shock) applied to birds in an imprinting situation led to an increase in approach 
rather than avoidance behavior. However, other more carefully controlled experi¬ 
ments have failed to support this distinction between imprinting and learning 
(Barrett, Hoffman, Stratton, & Newby, 1971; Hoffman 8c Ratner, 1973). Nox¬ 
ious stimulation can lead to avoidance during imprinting, just as it does in 
ordinary learning situations. 

Hess (1973) has also argued that imprinting differs from ordinary learning in 
that it is more long lasting and more irreversible. But these proposed distinctions 
have fallen on hard times, too. Life is filled with examples of long-lasting learning. 
(Will you ever forget your name or the alphabet?) And it has also been shown that 
imprinting can be reversed (Gaioni, Hoffman, De Paulo, 8c Stratton, 1978). 

Interaction Models 

Although there have been some attempts to characterize imprinting as nothing 
more or less than ordinary learning (see Fabricius, 1962; James, 1959; Moltz, 
1960, 1963), these efforts have been no more successful than the attempts to 
argue that imprinting is completely different from ordinary learning. 

Still others, in what may prove to be the best approach, have proposed that 
imprinting involves the interaction of both learning and innate elements. For 
example, Hoffman and Ratner (1973) suggest that imprinted behavior is partly 
innate and partly learned. The first assumption they make is that certain classes of 
stimuli (for example, moving objects) innately elicit filial responses. If a stimulus 
drawn from this critical class is presented to an immature bird, the bird will 
approach the stimulus in an innate fashion, with no learning necessary. If the 
authors left their model at this point, we would have to predict that the bird 
would run about following any moving stimulus. But we know that this is not the 
case. The bird does not become imprinted on the whole class of stimuli but rather 
on one particular stimulus drawn from that class. This is where the authors bring 
in the notion of classical conditioning. Assume that the decoy is a moving, green 
football. The critical aspect of the decoy (the fact that it is a moving object) is the 
UCS. This UCS innately elicits a UCR (approach and follow). The CS is composed 
of those initially neutral aspects of the decoy that distinguish it from other 
moving objects (for example, green color, small size). Initially, it is only the 
moving quality of the object that elicits filial responses. The other attributes of the 
decoy (greenness, smallness) are, at first, neutral or ineffective in eliciting filial 
responses. But when the ball is presented to the duck, the filial responses are 
classically conditioned to those initially neutral attributes. Thus, in the future, the 
presentation of the ball will elicit filial behavior because of both the innate UCS 
(moving object) -» UCR (approach) relationship, and because of the conditioned 
CS (green, small) -» CR (approach) connection. When the duck was presented a 


CHAPTER 4/BIOLOGICAL LIMITS 103 




Imprinted 

behaviors (Pigeon learning 

(duckling follows to key peck 

model) to avoid shock) 


Extremely 

prepared 


Innate 
behaviors 
(male stickleback 
attacks color red) 


If 


(Pigeon 
flying to 
avoid shock) 


3L 


V 


~T~ 

(Dog learning 
to yawn 
for food} 


Instrumental responses 
of varying degrees of difficulty 


Extremely 

unprepared 


Figure 4.10 The preparedness continuum as a dimension of survival mechanisms, with 
some selected behaviors included. 

choice between a man and the football, the combination of the innate and 
conditioned filial tendencies would ensure a preference for the football. 

The final fate of their model remains to be seen, but Hoffman and Ratner 
(1973) have attempted to clarify the ambiguous status of the imprinting concept 
by accounting for both innate and learned elements within the process, and their 
interpretation has received some support (see Gaioni et al., 1978). But their model 
is not free of difficulties. For instance, it is dependent on the assumption of a 
limited set of releasers. We have seen that this concept may be difficult to justify. 

Imprinting and the Preparedness Continuum 

Let us return to the main thread of the chapter. Not all responses are connected to 
all stimuli equally easily. How does imprinting relate to this theme? We have seen 
that it is difficult, if not impossible, to resolve the question of whether imprinted 
responses are innate, learned, or both. After reviewing the status of imprinting, 
many students, to say nothing of researchers in the field, are left with a certain 
sense of dissatisfaction. One way to lessen this uneasy feeling, or to gain perspec¬ 
tive on the issue, is to think about imprinting in relation to other behaviors on the 
preparedness continuum. We can think of the preparedness continuum presented 
in Figure 4.10 as a dimension of survival mechanisms. All of the behaviors 
encompassed by the dimension are designed, or somehow function, to ensure the 
survival of the species. Some are more fixed, or predetermined, than others. 
Figure 4.10 contains a few sample behaviors. You may consider where additional 
behaviors might fall on the dimension. 

Purely innate behavior represents one extreme end of the dimension. Present a 
stimulus to the animal, and the response will occur automatically. Neither prac¬ 
tice nor reinforcement is necessary to establish this association. It seems as if the 
connection is already there, or built into the animal’s nervous system. 

At the other end-of the continuum we find events that just do riot seem to be 
learnable, no matter how much practice or reinforcement is involved. As noted, 
for instance, it is almost impossible to teach a dog to yawn for food (Konorski, 
1967). 
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Imprinting can be considered an instance of highly prepared learning. It does 
not require much practice at all, nor is much, if any, reinforcement necessary for it 
to be established. The little duckling is ready to associate its filial responses with 
certain stimuli. The very fact that imprinting is so prepared actually leads to the 
confusion about whether it is innate or learned. By thinking of it as prepared 
behavior one can avoid some of this confusion. Rather than worrying about 
whether it is or is not learned behavior, we can merely think of it as very prepared 
behavior. Ethologists have traditionally examined behavior near the prepared end 
of the dimension, whereas learning psychologists have looked at less prepared 
behaviors. The controversy over innate versus learned behaviors may be a matter 
of degree rather than kind. 

It should be kept in mind that the preparedness continuum does not really 
explain anything. It is merely a frame of reference that enables us to put certain 
behaviors in perspective, to consider them in relation to one another, and to sort 
out our thoughts about how they might or might not differ. 

Humans and Imprinting 

Do humans imprint, or is the concept one that makes sense only in reference to 
“lower” organisms? Certainly, tiny children do not pad along in single file behind 
their parents. But very young children obviously do form strong social attach¬ 
ments to their parents or caretakers. They do tend to straggle after a moving 
parent, they sometimes react with fear when a strange adult approaches, and they 
approach their parents when offered a choice between a parent and a stranger. 

We tend to think of ourselves as possessing survival mechanisms that, when 
compared with those of the so-called “lower” animals, fall somewhere near the 
unprepared end of the dimension. That is, we tend to think of ourselves as more 
adaptive, more flexible, and less in the grip of rigid behavior patterns. But some 
authors have argued that we do possess characteristics reminiscent of imprinted, 
or at least highly prepared, behaviors. For example, our early socialization has 
been assumed by some to possess attributes of the imprinting process. Bowlby 
(1969) argues that human infants develop attachment behavior in a manner that 
is quite similar to the imprinting process observed in other mammals and in birds. 

Unfortunately, we really have little more than speculation and opinion concern¬ 
ing the existence of imprinting in humans. But the question is exciting, and it 
should generate a good deal of interesting research in the years to come. 

IMPLICATIONS FOR GENERAL LEARNING THEORY 
This chapter began with a discussion of the interchangeable-parts conception of 
learning, which has evolved gradually in American psychology. We then traced a 
number of areas of research that contradict this model of learning. These findings 
call into question the very notion that we may be able to arrive at a unified, 
general theory of learning. The questions we might ask are these: Do different 
degrees of preparedness require different laws, or different principles of learning? 
Does learning occur in a different manner at different points on the preparedness 
continuum? 

Seligman (1970) speculates that the laws of learning may very well vary with 
the continuum of preparedness. He writes: “Detailed studies which compare 
directly the delay of reinforcement gradients, extinction functions, etc., for pre- 
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pared versus unprepared associations are needed. It would be interesting to find 
that the extinction and inhibition functions for prepared associations were differ¬ 
ent than for unprepared associations” (p. 416). 

We are on a frontier of sorts. We have taken the issue of learning into the 
laboratory and have established, with some rigor, a set of general principles that 
seem to govern the establishment of associations in that setting. Now we are 
challenged by the notion that our rules and laws, developed under restricted 
laboratory conditions, may not apply to, or completely account for, learning 
accomplished by varied animals in the world outside the laboratory. New rules 
and laws may have to be developed for more diverse species, behaviors, and 
environmental settings. 


SUMMARY 

1. For the better part of this century many American psychologists have 
adopted a concept of learning that holds that all responses are about equally 
conditionable to all stimuli. 

2. This position is challenged by a growing body of literature, which indicates 
that the rule of equivalent associability is inadequate. 

3. Seligman has proposed a continuum of preparedness, which suggests that 
animals may be differentially prepared to associate certain events. 

4. Animal studies supporting the notion of a preparedness continuum come 
from the fields of instrumental conditioning, discrimination learning, and 
avoidance conditioning, among others. 

5. Humans, as well as other animals, are prepared to associate some events and 
unprepared to associate others. 

6. Bait shyness refers to the fact that, if an animal eats poisoned bait and 
survives, it will develop an aversion to the flavor of that bait. 

7. Although bait shyness possesses many of the characteristics of avoidance 
conditioning, one apparent difference is that the bait-shyness effect can be 
established with extremely long intervals (hours) between consumption and 
illness. 

8. Bait shyness supports the notion of a preparedness continuum, because it has 
been found that an aversive reaction will be associated only with certain 
aspects of the bait (for example, flavor). The animal is unprepared to associ¬ 
ate illness with other aspects of the bait situation (such as visual aspects). 

9. Preparedness differences may well be related to the pressures of natural 
selection. 

10. The “novelty,” or newness, of a stimulus will determine whether it becomes 
the functional CS in aversive taste conditioning. 

11. In a practical sense bait shyness may be useful in the control of predation. 

12. Use of the bait-shyness effect resembles aversion therapy in humans. In 
aversion therapy the human is stimulated with a noxious agent while engag¬ 
ing in or imagining the unwanted behavior (for example, smoking, overeat¬ 
ing). 

13. Species-specific defense reactions (SSDRs) are innate reactions that animals 
automatically produce in response to sudden or novel stimuli. Bolles argues 
that, if an avoidance response is easily acquired in the laboratory, it must 
necessarily be an SSDR. 
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14. Instinctive drift refers to the fact that after continuous conditioning innate 
behaviors sometimes begin to intrude into conditioning situations. 

15. Ethologists have proposed the concepts of reaction-specific energy, innate 
releasing mechanisms, sign stimuli, fixed action patterns, vacuum reactions, 
and supernormal stimuli in their study of innate behavior. 

16. Tinbergen presents a hydraulic model of innate behavior. 

17. Imprinting refers to the fact that a newly hatched bird will approach and 
follow the first moving object it encounters. The distance the bird follows the 
object seems to be more important than the time of exposure in determining 
the strength of imprinting. 

18. Imprinting should be studied in natural settings as well as in the laboratory, 
because there are important differences between these two situations. 

19. Auditory imprinting and olfactory imprinting, as well as visual imprinting, 
have been observed. 

20. Imprinting may be a multiple-channel event. 

21. Some authors have argued, without total success, that imprinted behavior is 
completely innate. 

22. The concept of a critical period refers to the idea that imprinted social 
attachments will be formed only during a very brief period in the bird’s life 
soon after hatching. 

23. Additional experimentation has shown this assumption to be false. Once fear 
is removed, older birds form social attachments. Imprinting is not limited to 
a critical period and on this basis cannot be distinguished from learning. 

24. Some authors have argued that imprinting and learning call forth different 
responses to the presentation of a noxious stimulus. Analysis of the ex¬ 
perimental work indicates that this distinction between imprinting and learn¬ 
ing is also difficult to maintain. 

25. Although some have argued that imprinting differs from learning in that it 
produces behaviors that are long lasting and irreversible, many learned 
behaviors also have these characteristics. 

26. Some authors have taken the opposite view, asserting that imprinted be¬ 
havior is learned. Currently there is no conditioning model of imprinting that 
is entirely satisfactory. 

27. If innate behavior and learned behavior are thought of as continuous rather 
than dichotomous, much of the confusion concerning the nature of imprint¬ 
ing evaporates. Imprinting can be thought of as highly prepared behavior 
falling somewhere near the innate end of the preparedness continuum. 

28. Some authors have speculated that the process of early socialization in 
humans is similar to imprinting. 

29. Psychologists have taken learning into the laboratory and have established 
with some rigor a set of laws and principles that seem to govern behavior in 
that limited setting. The areas of research described in this chapter represent 
a challenge to that set of laws. Whether our laboratory-based laws will be 
sufficient to account for the diverse behaviors of diverse species in diverse 
environmental settings remains to be seen. 
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I n the preceding chapters we were 
concerned primarily with the var¬ 
ious learning tasks employed by 
psychologists. We explored the rela¬ 
tionships among these tasks and the 
possibility of a general learning the¬ 
ory. In the following chapters we shall 
deal with the specific issues and con¬ 
cerns of the learning process, rather 
than with the general nature of learn¬ 
ing as a whole. 

There are several ways to approach 
the field of learning. One way is to 
consider, in a sequential fashion, the 
works of influential theoreticians in 
the field. Good examples of this rea¬ 
sonable strategy are provided by 
Bower and Hilgard (1981) and Hill 
(1971). We shall take an alternative 
tack in which issues and problems, 
rather than theories, are considered 
sequentially. We shall trace the issues 
that have been of concern to all major 
thinkers in the field and refer to their 
works when relevant. Hulse, Egeth, 
and Deese (1980) provide another ex¬ 
ample of this approach. 

The next three parts of the text are 
labeled “Acquisition,” “Transfer,” 
and "Retention.’ A few comments 
concerning this breakdown of the 


field are in order. When first faced 
with the field of learning, students are 
often perplexed by the mass of data 
and interrelated issues. They have dif¬ 
ficulty in grasping the limits of the 
field and in some way making it com¬ 
prehensible and manageable. One 
way to ensure some grasp of the field 
is to think of it as reducible to these 
three subprocesses. Almost any learn¬ 
ing study we run across, whether re¬ 
porting animal or human learning, 
can be included in one of these cate¬ 
gories. The major thrust of the inves¬ 
tigation is usually with one or the 
other of these processes. 

Acquisition refers to those process¬ 
es that occur during the establishment 
of an association. When we deal with 
acquisition, we are concerned with 
events relating to the building of asso¬ 
ciations, presumably through practice 
and reinforcement. If we study how a 
rat increases its lever-pressing be¬ 
havior for food, we are focusing on 
the acquisition process. If we study 
the relationship between a child’s in¬ 
creasing ability to read and the 
amount of praise given for each suc¬ 
cess, we are primarily concerned with 
acquisition. 
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Transfer refers to the effects of one 
learned task on subsequent attempts 
to learn or perform additional tasks. 
Positive transfer refers to the situation 
in which the learning of one task 
facilitates the learning of additional 
tasks. If we observe that children’s 
ability to memorize one set of vocab¬ 
ulary words is facilitated by their hav¬ 
ing previously learned another set of 
words, we have demonstrated posi¬ 
tive transfer. Negative transfer refers 
to the situation in which prior learn¬ 
ing hinders present performance. We 
can see how important the concept of 
transfer must be in the field of educa¬ 
tion. Educators want to be able to 
arrange classroom materials and pro¬ 
cedures so that positive transfer is 
obtained. 

Retention refers to what happens 
to associations or learning after prac¬ 
tice has ceased. After associations 
have been acquired and are no longer 
being rehearsed, what happens to 
them? Do they “remain with us” for¬ 
ever? Do they fade away? How quick¬ 
ly? What factors determine the rate at 
which things are forgotten? As we ask 
these kinds of question and focus our 
attention on the fate of learned asso¬ 
ciations, we are primarily concerned 
with the retention process. 

Two notes of caution should be 
sounded about our approach. First, 
some studies are obviously, and legiti¬ 
mately, concerned with more than 
one of these processes. For instance, a 
given study may easily be concerned 
with the establishment of associations 
and with their eventual fate onceprac- 
tice has ceased. Second, and more im¬ 
portantly, the distinctions among ac¬ 
quisition, transfer, .and retention are 
in fact somewhat arbitrary and super¬ 
ficial. They should be taken as conve¬ 
nient, but not necessarily crystal- 
clear, distinctions. For instance, ac¬ 


quisition is probably never free of re¬ 
tention effects. Consider a paired- 
associate learning situation, such as 
the learning of a list of Spanish-En- 
glish equivalents. The subjects ac¬ 
quire the list by repeatedly rehearsing 
it. Although the emphasis is on the 
acquisition process, the situation is 
obviously not free from retention 
effects. The subject must remember 
the materials from trial to trial. To the 
extent that she utilizes what she 
learned on the last trial, the retention 
process is intertwined with the ac¬ 
quisition process. 

Similarly, transfer experiments al¬ 
ways involve the retention process. 
For one task to affect performance on 
a second task, either positively or 
negatively, it must be remembered. 
Furthermore, retention experiments 
involve transfer processes. In fact, re¬ 
tention experiments may represent a 
special case of transfer—the one in 
which the two successive tasks are 
identical. The subjects learn a list of 
word pairs on Day 1. On Day 2 they 
are given the stimulus words and 
asked to recall the response words. 
We are looking at the effect of one list 
on the subsequent performance of the 
very same list, or, in other words, a 
transfer situation in which the two 
successive tasks are as identical as we 
can make them. 

Without reviewing all the possibili¬ 
ties we can see that the processes of 
acquisition, transfer, and retention 
are all interconnected. You might take 
a moment to pursue some of the com¬ 
binations. In what way do acquisition 
experiments almost always involve 
transfer effects? How is acquisition 
involved in retention? How is reten¬ 
tion involved in transfer? 

In Part Two some of the major 
issues in the area of acquisition are 
considered. Traditionally, the issues 
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of contiguity , practice, and reinforce¬ 
ment have maintained central posi¬ 
tions in this arena. In fact, major 
theoreticians most often disagree 
about these three issues, or variables. 
Their roles in the acquisition process 
are controversial. We shall see what is 
known about each by drawing on 
both animal and human research. 

The contiguity issue is taken up in 
the first part of Chapter 5. Must a 
response occur in the presence of a 
stimulus for learning to occur? Or can 
an association be established if the 
stimulus occurs at Time 1 and the 
response does not occur until Time 2? 
Is stimulus—response contiguity nec¬ 
essary and sufficient for learning to 
occur? 

In the latter part of Chapter 5 we 
deal with the role of practice in learn¬ 
ing. Is practice, or rehearsal, neces¬ 
sary for learning to occur? What is 


the relationship between rehearsal 
and strength of learning? The answer 
seems obvious. The more we practice, 
the more we learn; but it is not quite 
so simple as it might appear. For in¬ 
stance, some argue that increasing 
practice does not increase the strength 
of learning. They argue, and quite 
persuasively, that learning occurs in 
an all-or-none fashion. According to 
this controversial position, learning is 
either complete or does not occur at 
all in a given trial. 

In Chapter 6 we deal with the role 
of reinforcement in acquisition. Is 
reinforcement necessary for learning 
to occur? What happens to learning 
when we vary such factors as the 
amount and kind of reinforcement 
that we deliver to the animal? What 
are some of the current models, or 
conceptions, of reinforcement? 
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HOW IMPORTANT IS CONTIGUITY IN LEARNING? 

One of the oldest assumptions in the field of learning is that temporal contiguity 
of the stimulus and response is necessary for learning to occur. It has generally 
been assumed that the response must occur in the presence of or very soon after 
the stimulus for an association to be established. In other words, only events that 
occur together in time can be associated. 

Intuitively, it seems extremely unlikely that two events will be associated if they 
are separated by any considerable length of time unless, of course, the subject 
somehow brings them together, either mentally or physically (see Rescorla & 
Cunningham, 1979). For example, there is no intuitive reason to assume that an 
event experienced today (for example, the sound of a hand clap) will in any way 
be associated with a particular response made tomorrow (speaking the word 
table). The fact that noncontiguous learning seems so unlikely may account for 
the contiguity principle’s being accepted so universally and uncritically. Few seem 
to doubt it. But believing something to be true is not the same as knowing it to be 
true, and so we shall revisit this issue. We shall challenge the law of contiguity by 
attempting to discover instances of noncontiguous learning in the experimental 
literature. If we can find some examples of noncontiguous learning, then the 
notion that contiguity is necessary for learning to occur must be modified. 

We can also ask whether stimulus—response contiguity (S—R) is sufficient for 
learning to occur. If a response occurs in the presence of a perceived stimulus, will 
learning always take place, without any other conditions (such as reinforcements) 
being met? Is contiguity all that is necessary? 

If experiments can be found that demonstrate the presence of learning in the 
absence of contiguity, then we will have some reason to believe contiguity is not 
necessary for learning to occur. But to answer the question of the sufficiency of 
S—R contiguity we have to conduct a different search. We have to turn things 
around to look for an absence of learning in the presence of contiguity. If we can 
find such situations, we shall have some evidence for the notion that overt S-R 
contiguity is not all it takes for learning to occur. Thus, two searches must be 
undertaken, one for noncontiguous learning (relevant to the necessity of contigu¬ 
ity) and another for the absence of learning in the presence of S—R contiguity. 

WHAT ARE STIMULI AND RESPONSES? 

Before we begin our dual search, we should consider the nature of stimuli and 
responses a little more closely. This text has adopted what is usually referred to as 
an S—R, or behaviorist, language. According to this particular way of speaking 
and thinking, learning refers to an increase in the tendency to behave, or respond, 
in a particular fashion when confronted with a particular stimulus situation. We 
refer to this increasing tendency to respond as the building, or strengthening, of 
an association between that particular response and that particular stimulus 
situation. The S—R language is not the only way to speak of learning. But it is 
convenient, and it has been widely adopted. This text does not use it because it 
represents the only way to understand the learning process. Like any language it 
has its limitations; and its use must be kept in perspective. 

Stimuli 

Logan (1970) defines a stimulus as “ any adequate change in energy falling upon 
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an appropriate sensory receptor” (p. 8). In other words, any detected energy 
change can serve as a stimulus. For example, lights, sounds, tastes, odors, and 
shock can serve as stimuli. The offset of any of these can be as effective as their 
onset. Relationships can serve as stimuli. An animal can learn to respond to the 
brighter of two stimuli, regardless of their absolute values. Even responses can 
sometimes function as stimuli. When we speak and hear ourselves, for instance, 
the sounds can serve as stimuli for further responses. If we detect that we are 
speaking too loudly, we may respond by speaking more softly. In addition, we use 
the sound of our own voice to keep track of where we are in our sentences. 

Presumably, other types of action are equally affected by the stimulus qualities 
of our responses. For instance, if gymnasts did not receive sensory input, or 
feedback, from the body as they went through their complicated routines, they 
would probably lose track of what they were doing. Visual cues and the like 
would help, but they would not be sufficient. We would hardly expect a top-notch 
performance from athletes who were dependent on seeing their legs go over the 
bar without also feeling it happen. 

Thinking probably represents another case in which responses serve as stimuli. 
One “thought” leads to another. One cognitive action, or response, serves as a 
cue for the next. 

Any detected event that is distinguishable from other events can serve as a 
stimulus. It should always be kept in mind, however, that not all energy changes 
are equally effective as stimuli. For instance, a pigeon will readily associate illness 
with visual stimuli, but not so readily with taste. Rats, on the other hand, readily 
associate illness with taste but not with visual stimuli. An effective stimulus for 
one species may not be so for another (Garcia & Koelling, 1966). 

In addition, if we are not paying attention to a particular energy change, it 
tends not to be used as a stimulus. Attention is an important, but poorly defined, 
concept in psychology (see Bourne, Dominowski, & Loftus, 1979). At any given 
moment our sensory receptors are being bombarded by a large number of energy 
changes, each of which has the potential to be a stimulus. But somehow we are 
not aware of all of them at any given moment. We channel our attention, focusing 
on only a few of these many potential stimuli. As you read these words, for 
instance, you are attending to the print. You are probably not attending to your 
hands, even though they are part of the total stimulating pattern. You can attend 
to them if you wish. Without moving your eyes away from the printed words you 
can be aware of their position, their shape, and their movement. If an energy 
change is to be effective as a stimulus, our attention must be directed toward it. 

And finally, the continuation of a stimulus is not dependent on the continued 
stimulation of sensory receptors by some outside energy source. Once an external 
stimulus has activated sensory receptors, the message is “taken into” the organ¬ 
ism and maintained therein. Traces of external stimuli, or the neural remainders 
of external stimuli, can take various forms and can persist for varying amounts of 
time. 

The term stimulus trace has been used in several ways. On the one hand, it has 
been used to refer to essentially sensory phenomena. For instance, we are all 
familiar with afterimages. As children we discovered that if we stared at a 
lightbulb for some time and then looked away, we could still see something that 
resembled the bulb. This afterimage, or trace, fades rather quickly. It represents 
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one instance in which a terminated external stimulus “lives on” within the 
organism. 

The afterimage represents a stimulus trace that decays rather rapidly, but there 
are some traces that may persist indefinitely. Consider what you might do if an 
attractive person gave you a phone number in a crowded, noisy situation. You 
might desperately rehearse it, hoping to preserve the trace until you could write it 
down or until it became part of your permanent memory. The entire field of 
memory and retention is, in a sense, concerned with stimulus traces. When we 
remember something, we are often dealing with the trace of a long-gone external 
stimulus. 

Critics of the S—R approach have argued that it is impossible to identify the 
effective stimulus in a learning situation. What, for example, is the stimulus for 
laughing at a joke? What is the stimulus for reaching a creative solution to a 
problem? What is the stimulus for learning to swim? Some feel that the S—R 
conception, with its emphasis on discrete, identifiable stimuli, is simpleminded. It 
may well be. Life is obviously a good deal more complicated than a lot of stimuli 
and responses. The criticism is just, and it is not brushed aside easily. 

Yet several points can be made in defense of the S-R position. First, just 
because we cannot always identify the functional stimulus does not mean that we 
never can. There are very many situations in which the most important stimuli 
can be identified easily. Second, just because we cannot always observe the 
effective stimuli directly does not necessarily mean that there are no such stimuli. 
It may merely be beyond our technical ability to measure or observe them at the 
present time. Third, the S—R language, although imperfect, does provide a struc¬ 
ture within which to think of the complicated world of learned events. 

Responses 

It is as difficult to define a response as it is to define a stimulus in our S—R 
language. Let us say a college student, wanting to get acquainted with one of his 
classmates, asks her if she will join him for a cup of coffee. To his inquiry the 
woman responds with a flat no. That no is undoubtedly a “response,” but it is 
more difficult to determine where the response began and where it ended. As she 
listened to the man’s question, she apparently experienced some kind of emotion¬ 
al reaction. Her emotional reaction was then somehow translated into a verbal 
response. But the response does not even end there. Presumably neither she nor he 
would immediately return to a state of nonresponding. In all likelihood some sort 
of uncomfortable disengagement process would ensue. Even after they had gotten 
away from each other, the chain of behaviors would not end. They both would 
probably think about the encounter for some time. The situation obviously 
involves a chain, or series, of complex actions and reactions. 

We must now try to handle this enormously complicated situation with our 
S—R conception of behavior. We must do two things in order to isolate a distinct 
response from this “behavior chain.” First, we acknowledge the fact that behavior 
is continuous. We are always doing something; we never stop behaving. We may 
be more or less active, blit we are always behaving. For example, we behave even 
when we are asleep: We breathe, we produce antibodies, we jump at the tiny 
suspicious sound of a window being rattled, but we remain undisturbed when a 
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truck goes by. Many of us keep track of time, so that we awaken more or less at 
the time we wish to awaken. 

Second, having acknowledged that behavior never ceases, we can define a 
response as any identifiable segment of this continuous behavior process. As 
Logan (1970) puts it, “A response is any glandular secretion, muscular action, or 
other objectively identifiable aspect of the behavior of an organism” (p. 25). A 
response can be anything from a muscle fiber twitch to the complex, goal-oriented 
behavior of the whole organism. The key here is that the segment must be 
objectively identifiable. Scientists must be able to agree that a given segment of 
behavior has occurred. If they cannot, they have gotten nowhere. For example, 
psychologists would probably not agree that our female friend had rejected the 
male. They could agree that she had responded with a verbal rejection, but they 
might not agree whether she meant it or not. Further observations would have to 
be made before agreement could be reached. 

In summary, we accept the notion that behavior is not discrete. It is a con¬ 
tinuing process from which we select segments to study. They may be short 
segments (for example, pronouncing the letter T, eye blinks) or long segments 
(success in college, the development of political attitudes). We then attempt to 
devise methods of measurement that will ensure that we can all agree whether our 
chosen behavior segment has or has not occurred. The psychology of learning is 
the study of how these behavior segments become more and more likely to occur 
in the presence of particular stimuli. 

IS CONTIGUITY NECESSARY? IS IT SUFFICIENT? 

To return to the main thread of this chapter, we now begin our search for 
experiments that will disprove, or at least require modification of, the contiguity 
principle. We begin by looking for learning that occurs in the absence of S—R 
contiguity. (Later we shall seek situations in which learning does not occur even 
though S—R contiguity is established.) If we can convince ourselves that we have 
some clear examples of noncontiguous learning, then we may want to reconsider 
the contiguity principle. 

Subjective Organization and Contiguity 
The first of several areas that we will review has to do with the concept of 
subjective organization (SO) (Tulving, 1962, 1964, 1966). In the simplest kind of 
SO experiment a randomly arranged list of unrelated words is presented, one 
word at a time, to the subjects. The subjects then attempt to recall as many of the 
words as they can, in any order they wish. The list then is presented to the 
subjects again, but this time in a new random order. The subjects again attempt to 
recall the words. The experiment involves a series of these successive phases of 
presentation and recall. 

Tulving noticed that the subjects began to recall certain items next to each 
other, or sequentially, even though the items were scattered throughout the 
random presentation orders. The number of pairs the subjects began to recall 
sequentially increased as the experiment progressed. Each subject began to de¬ 
velop his or her own unique sets of items that were consistently recalled together. 
It seemed that the subjects somehow organized the materials “inside their head.” 
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The fact that items are consistently recalled together in the absence of temporal 
contiguity during the input phase suggests that we may have found an instance of 
noncontiguous learning. Two items do not occur together in time during the 
presentation phase, yet they are consistently recalled together. Some kind of 
association has developed between them. Where is the temporal S-R contiguity? 

There are at least two possible answers to this question. First, it may well be 
that the subjects merely bring items together “in their own mind.” That is, even 
though the items are never presented together externally, there is nothing to 
prevent the subjects from carrying traces of the items for long periods. Such an 
argument is certainly reasonable, but there are few hard data to support it. This 
hypothesis may well be correct, but we would not want to accept it without some 
objective demonstration of its validity. 

Another analysis suggesting that S—R contiguity occurs in SO experiments has 
to do with the output, or recall, phase of the experiment (Wallace, 1970). In many 
SO experiments the subjects are asked to write down all the words they can 
remember on a sheet of paper. In addition, they are typically given several minutes 
to attempt recall on each trial. Suppose that one subject randomly recalls and 
writes down barn and fiction together on the very first trial. Let us assume that 
there is no association between them at this time. There they are, right in front of 
him. He knows he is going to have to try to recall the list again. He rapidly 
rehearses the items he has already recalled, hoping to preserve them for the next 
trial. By looking at barn and fiction together he is, in effect, experiencing S-R 
contiguity. They occur together in time after all. We can see how SO might build 
up over trials. Two items randomly recalled together on the first recall trial would 
be rehearsed as a pair. The resultant association between them would lead to a 
greater likelihood that they would be recalled together on future trials, and so on. 

Wallace (1970) presents data that support this output-rehearsal hypothesis. He 
found that amount of SO decreased when he set up experiments so that rehearsal 
during the output phase was reduced. For instance, he found that SO decreased 
when he substituted oral recall for written recall. In oral recall S—R contiguity is 
much more fleeting and less stable than in written recall. He found that SO 
decreased when the subjects were required to write down successively recalled 
items in noncontiguous positions on the recall sheet. The scattered positions of 
successively recalled items presumably reduced the probability that they would be 
rehearsed in a contiguous manner. 

In summary, what appeared to be noncontiguous learning may be the result of 
either “mentally” established contiguity, or S—R contiguity established during the 
recall phase of the experiments. Subjective-organization experiments do not, after 
all, provide us with a clear instance of noncontiguous learning. 

Paired-Associate Learning and Contiguity 
In paired-associate (PA) experiments temporal S—R contiguity is the rule rather 
than the exception. The experimenter generally ensures that the response will 
occur in the presence of the stimulus. Each time through the list, the stimulus term 
is normally presented alone for 2 seconds. The stimulus and the response terms 
are then presented together for 2 seconds. Hence, PA research is not an ideal place 
to look for learning in the absence of S-R contiguity. 

There have been a few studies, however, in which the time between the PA 
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stimulus and the PA response has been varied. In these studies the stimulus is first 
presented alone. Then there ensues an empty time interval, which is varied in 
length. Finally, the response term is presented alone. The question is whether 
learning will occur when the stimulus and the response are separated in time. 
Guthrie (1933) and Martin and Schultz (1963) employed intervals ranging from 2 
to 6 seconds. Interestingly, they found better learning with longer intervals. These 
results would seem to be in conflict with the contiguity principle. The greater the 
separation of the S and the R in time, the better is the learning. The reason for this 
seeming contradiction soon became apparent. Martin (1966) attempted the same 
type of experiment, but with one major modification. He had the subjects count 
backward by 3s during the intervals. Under these conditions the effect reported by 
Martin and Schultz (1963) could not be replicated. In other words, the increase in 
learning with increases in the interval in the earlier experiments must have had 
something to do with what the subjects did during that interval. In the earlier 
experiments subjects were left free to rehearse or to carry the stimulus trace 
forward in time, and their performance improved as the interval increased. In 
Martin’s (1966) experiment the subjects were prevented from rehearsing or car¬ 
rying the stimulus trace forward, and their performance did not improve as the 
interval increased. Counting backward by 3s presumably distracted the subjects 
from the task at hand and prevented them from utilizing the time in rehearsal or 
maintaining a stimulus trace. 

The ideal experiment investigating S—R contiguity in the PA situation would 
involve not only temporal separation of the stimulus and the response but also 
control of the subject’s activities during the interval. Nodine (1969) ran an 
experiment with these qualities in mind. The subjects learned PA lists with 
intervals ranging from none to 6 seconds. In addition, Nodine varied the interval 
activities of the subjects. In some conditions subjects named numbers in an 
attempt to minimize rehearsal and the carrying forward of the stimulus. In other 
conditions subjects were asked to pronounce the stimulus over and over again 
during the interval, thereby maximizing mediated S-R contiguity. In a final set of 
conditions the subjects were free to do as they wished during unfilled intervals. 
Nodine’s analyses suggest that both S—R contiguity and rehearsal within the 
interval are important. The two factors appear to operate against each other. On 
the one hand, as the interval increases, there is a decrease in the tendency for an 
association to be formed between the stimulus and the response. This is the 
contiguity principle in action. On the other hand, if the subject is not prevented 
from rehearsing or carrying the stimulus trace forward, then there is an increase 
in the tendency for an association to be formed as the interval increases. 

In summary, these studies suggest that, unless the stimulus is closely followed 
by the response or unless the subject can carry the stimulus trace forward in time, 
there is little chance that learning will occur. Once again the necessity of contigui¬ 
ty is upheld. 


The CS—UCR Interval and Contiguity 

In Chapter 2 we saw that a CS-UCS interval of about .5 of a second is often 
optimal for classical conditioning. Longer and shorter intervals result in poorer 
conditioning. Generally speaking, these facts tend to support the contiguity 
principle: For learning to occur, a response (the UCR elicited by the UCS) must 
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occur at the same time, or very soon after, the stimulus (the CS). But if there are 
any exceptions to this rule, then we may be able to think about them in terms of 
noncontiguous learning. 

Revusky (1968) ran an interesting bait-shyness experiment in which a sweet 
solution (the CS) was ingested at a given time. Seven hours later the rats were 
X-irradiated and became ill (the UCS-UCR sequence). Despite this exceedingly 
long CS-UCR, or stimulus-response, interval, an aversion to the sweet flavor 
developed. Many investigators have obtained similar results (Garcia, Ervin, 8c 
Koelling, 1966; Smith 8c Roll, 1967). 

This result does seem to come closer than the other types of experimentation 
we have discussed to demonstrating noncontiguous learning. But some have 
argued persuasively that learning over long CS—UCR intervals does not represent 
noncontiguous learning. In presenting their argument they evoke the idea of trace 
conditioning. Some kind of lingering trace of the CS (sweet flavor) might bridge 
the long CS—UCR interval (see Kehoe, Gibbs, Garcia, 8c Gormezano, 1979). 

Two types of trace might be operating here. First, we might consider actual 
physical traces of the stimulus, such as chemicals lingering in the mouth or other 
parts of the system. Although such physical traces might bridge the time gap, 
Garcia, McGowan, and Green (1972) argue against such a possibility. They 
report that extremely transient tastes were as effective in ensuring conditioning as 
stronger, more lingering tastes. If a physical trace had mediated the learning, we 
would have expected the more durable physical traces to be more effective, but 
they were not. 

A more likely trace interpretation is based on the mechanisms of memory. That 
is, the memory of the CS, or the neurological trace of the physical stimulus, is 
much more likely to mediate the time gap than is an actual physical trace of the 
stimulus. As mentioned earlier, the more novel a CS, the more likely it is to 
become an effective stimulus (see Dawley, 1979; Garcia Sc Rusiniak, 1979). This 
novelty effect seems to support the memory interpretation, because a novel CS is 
probably more likely to be remembered than a less novel one. Although some 
investigators (Garcia, McGowan, Sc Green, 1972) feel that mediators cannot 
account for the full effect, these kinds of potential “mental” mediators have not 
been eliminated from the methodology of these experiments. In conclusion, 
noncontiguous learning has not been demonstrated conclusively. The most 
reasonable conclusion to draw at present is that S—R contiguity is extremely 
important, if not actually necessary, for the establishment of associations. 

Contiguity and CS “Blocking” 

So much for the question of the necessity of contiguity in learning. We now turn 
to our second major search—that having to do with whether S—R contiguity is 
sufficient for learning to occur. If a response occurs in the presence of a stimulus, 
will an association be formed between the two, or must other conditions be met 
for learning to take place? The simplest sort of search is to look for experiments 
that fail to show any learning occurring even though S—R contiguity exists. 

We have already visited Kamin’s blocking effect in Chapter 2 (see Cheafle 8c 
Rudy, 1978; Haggbloom, 1983; Kamin, 1969b; Kohler 8c Ayres, 1979; Rescorla 
8c Colwill, 1983). But the effect is relevant to the issue of the sufficiency of 
contiguity. As you will recall, a blocking experiment often involves two groups. 
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One group is presented with a compound stimulus (designated AX) paired with a 
UCS. A second group, before receiving identical treatment, is given pretraining, 
during which the A component of the compound stimulus is paired with the UCS. 
Then, following AX-UCS training, X is tested alone. It is found that X is more 
likely to elicit a CR when the subject did not have prior training with the A 
component alone. But when you think about it for a moment, it becomes 
apparent that X was paired with the UCS (and therefore with the UCR) the same 
number of times in both groups. Stimulus—response contiguity was established 
equally in both groups, and yet learning was not equal. This effect suggests that 
there must be something more to learning than S—R contiguity. If S—R contiguity 
was a sufficient condition for learning to occur, then X should have become an 
equally effective CS in both groups. 

Informational Theory 

In Chapter 2 we also described briefly some recent informational theories of 
classical conditioning (see Mackintosh, 1975; Rescorla 8t Wagner, 1972). 
According to these theories, a stimulus becomes an effective CS only when it 
provides the subject with information about the occurrence of a forthcoming 
UCS. What is the relevance of this for the question of the sufficiency of S-R 
contiguity? These theories argue that, although S-R contiguity may be necessary, 
it is certainly not sufficient for learning to occur. 

Consider the kind of experiment that these investigators have reported. In one 
condition a CS is paired with a UCS a given number of times. In another 
condition the same CS is paired with the same UCS the same number of times. 
(S—R contiguity is identical in both conditions.) But in the second condition the 
UCS is also presented equally often without the CS. That is, the UCS is as likely to 
occur without the CS as with it. In other words, the occurrence of the CS in the 
second condition tells the subject nothing about the occurrence of the UCS. 

If S—R contiguity were sufficient for learning, then the CS in both conditions 
should become effective, because S—R contiguity is identical. But that is not at all 
what happens. The CS only becomes effective (starts eliciting a CR) in the first 
condition, where it predicts the occurrence of the UCS. (See Spence, Haggard, and 
Ross [1955] for more about this issue.) 

Contiguity and Short-Term Memory 

Glenberg and associates (Glenberg & Adams, 1978; Glenberg Bradley, 1979; 
Glenberg, Smith, & Green, 1977) have reported data from short-term-memory 
situations that bear directly on the question of the sufficiency of S—R contiguity. 
Basically, their experimental situation involves the following: Subjects are first 
given a four-digit number and instructed to remember it. Pairs of words are then 
presented, and the subjects are asked to repeat the pairs a prescribed number of 
times. The subjects are then asked to recall the four-digit number. They are led to 
believe that the word pairs are simply “fillers” and that recall of these words will 
not be asked for. Repeating the word pairs is merely supposed to prevent the 
subjects from rehearsing the critical four-digit number. In other words, the 
subjects believe that they are not supposed to try to learn the word pairs. But 
then, following the test for recall of the four-digit numbers, the subjects are given 
a surprise test for the word pairs. If S-R contiguity is sufficient for learning, these 
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word pairs should be learned. But they are not. In one experiment the subjects 
were given one member of each pair and asked for the second member. Only two 
correct responses were obtained in 1080 tries. In other words, virtually nothing 
had been learned even though S—R contiguity had been established. We know 
that the subjects attended to the word pairs, because they vocalized them. But still 
no learning occurred. In short, S—R contiguity was not sufficient for learning to 
occur. 

A review of the available literature leads us. to conclude that S—R contiguity is 
extremely important, if not actually necessary, for learning to occur. But S—R 
contiguity does not appear to be sufficient for learning to occur. Other factors, 
such as reinforcement, intent, motivation, and information, which we will ex¬ 
plore in upcoming chapters, appear to be essential to the learning event. 

THE ROLE OF REHEARSAL IN LEARNING 

Having completed our review of the role of contiguity in learning we now turn to 
the second major topic of this chapter—that of the role of practice , or rehearsal, 
in learning. Does rehearsal enhance learning? If so, how? What happens to the 
strength of the association as we increase the frequency with which the response 
occurs in the presence of the stimulus? Intuitively, the answer seems simple: The 
more we practice, the more we learn. The more that athletes practice their 
particular sport, the better they will be at it. The more we study, or rehearse, 
materials for an exam, the better we know them—up to a point, of course. Once 
we have completely mastered the materials, further rehearsal will lead to little 
additional gain. 

Our intuitive sense of the world argues that the relationship between rehearsal 
and learning is positive, with continued practice leading to continued gains until 
the task is mastered. But psychological research and thinking have suggested that 
the relationship between learning and rehearsal may not be so simple. There is 
some question about the role of rehearsal in the learning process. Some psychol¬ 
ogists argue that our intuitive sense is correct, and that learning is gradual, like 
stage lights coming up slowly in a theater. Other psychologists have argued that 
learning occurs in an all-or-none fashion, like a kitchen light being snapped on 
abruptly. According to the all-or-none position, learning occurs completely in one 
trial, or it does not occur at all. 

At first glance the all-or-none interpretation does not seem to make sense. The 
data from classical conditioning, instrumental conditioning, discrimination learn¬ 
ing, free-recall learning, and just about any other type of learning situation 
suggest that increases are gradual, cumulative events. There are few dramatic rises 
from zero performance to complete performance in one trial. How does the 
all-or-none position explain these kinds of data? For now, think of it like this: 
When you sit down to study for the examination in this class, it will appear that 
your mastery of the material is gradual. Your overall grasp of the content of the 
class will increase slowly through the agonizing hours. And yet, even though it 
appears that your grasp of the material is slowly becoming stronger, any given bit 
of learning may occur in an all-or-none fashion. You may all of a sudden hook up, 
or associate, the name Garcia with bait-shyness. You may in an all-or-none 
fashion connect the label backward conditioning with the notion of the UCS 
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coming before the CS. Similarly, a golfer’s game might gradually improve over the 
years, but this improvement might be the result of lots of little bits of learning that 
occur in an all-or-none fashion. One day the golfer might abruptly learn to keep 
her head down. The next week she might abruptly learn to follow through 
properly, and so on. The appearance of gradual improvement might be the 
summed result of many little, discrete bits of learning that occur in an all-or-none 
fashion. This issue is an interesting one, and we will return to it in later sections of 
this chapter. 


LEARNING CURVES 

A learning curve is a visual representation of performance over a series of trials, 
or time intervals. In Chapter 3 we saw that the relationship between performance 
and practice is typically summarized by a negatively accelerated learning curve 
(see Figure 3.5). That is, as practice progresses, performance increases rapidly at 
first and then levels off. This negatively accelerated curve is a widespread phe¬ 
nomenon appearing in many diverse areas of investigation. But it is not the only 
curve that appears in our research. Even though the negatively accelerated curve 
is very characteristic, we do not want to fall into the trap of thinking of it as “the” 
learning curve that represents some irrefutable mathematical law of nature. 

Learning Curves Are Not Learning Curves 
Learning curves are not really learning curves at all; they are performance curves. 
That is, they are the result of how much the organism has learned and how 
motivated it is. If you recall, performance refers to the activation of learning 
through the involvement of motivation. In a sense, we can say that Performance 
= Learning X Motivation. To infer the amount of learning that has occurred we 
must be able to measure performance and estimate the organism’s level of 
motivation. 

If we observe one rat rapidly running down a straight alley to food, for 
example, and a second rat walking rather aimlessly along in the same apparatus, 
we cannot immediately conclude that the first rat has learned more than the 
second rat. It may just be more motivated. We can see how errors could be made 
in estimating the amount of learning. If we were not aware that the first rat was 
hungrier than the second, we might incorrectly conclude that the first rat had 
learned more. There are experimental designs that help us to decide whether 
performance differences result from learned or motivational factors. These de¬ 
signs will be discussed in the section in Chapter 6 concerned with amount of 
reinforcement. 


Performance Measures 

Given that the amount of learning is inferred from some index of performance, 
what measures of performance are available? How do we measure the perform¬ 
ance of a rat running down a straight alley for food? We could do it in any 
number of ways. We could measure its running speed; the faster it runs, the more 
we assume it has learned. We could measure the latency of its response; that is, we 
could measure how long it remains in the starting box before it starts off down 
the alley. The quicker it leaves the box, or the shorter the latency, the more 
learning we would infer. We could use resistance to extinction by removing food 
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from the situation. The longer the rat persists in running down the alley to an 
empty goal box, the more learning we would want to infer. 

If we consider something like paired-associate learning, we run into a number 
of different performance measures. For example, the amount of PA learning is 
often estimated by counting the number of correct responses that the subject 
produces in a given number of trials. The parameter trials to criterion is also used 
as a measure of performance. The experimenter counts the number of times the 
subjects must go through the list before they meet some predetermined criterion 
(such as once, or perhaps twice, through the lisuwithout an error). The availabil¬ 
ity of all these performance measures creates problems. We will now examine a 
few of them. 

RELIABILITY OF THE MEASURE. Suppose that psychologists wish to study the 
effects of smoking marijuana on driving an automobile. They have their subjects 
puff away and then observe their ability to drive through a carefully constructed 
driving course. The psychologists find, relative to some appropriate control, that 
the subjects show poor driving ability. Though tempted to conclude that smoking 
marijuana hurts driving ability, the experimenters must first ask themselves 
several questions. One of these has to do with the reliability of their performance 
measure. The measure must yield similar results on different occasions. If the 
experimenters ran the same subject through the same test on several different 
occasions, they should obtain similar results. If they do, then they are using a 
reliable measure. If they do not, then their measure is unreliable, and their 
conclusions are invalid. 

SENSITIVITY OF THE MEASURE. The top half of Figure 5.1 contains some 
hypothetical data produced by two rats in a straight runway situation. In each 
case the rats were fed when they reached the goal box at the end of the runway. 
Rat 1 was given 30 trials and then taken out of the testing situation. At that time 
it was running at a constant speed. Its performance had leveled off. After 30 trials 
Rat 2 was running at the same constant speed as was Rat 1. But instead of being 
removed from the situation Rat 2 was given an additional 10 trials. During these 
additional trials (Trials 31—40) Rat 2 showed no improvement whatsoever. Thus, 
after 40 trials Rat 2 was performing at the same level Rat 1 had reached after 30 
trials. The question is this: How much have the two rats learned? If we use 
running speed as an index of how much was learned, then we conclude that the 
two rats had learned the same amount and that Rat 2 had learned nothing during 
Trials 31-40. 

There is a problem with this conclusion. A rat can run just so fast. Once it has 
reached its top speed, it can go no faster, but it might still be learning. Rat 2 may 
be learning during Trials 31—40, but the increase in learning would not show up 
in running speed because a ceiling had been reached. If we assume that some 
learning might occur during Trials 31—40, it would be a mistake to use running 
speed as our performance measure, because it would not be sensitive'at all levels, 
especially the higher-levels. 

One way of resolving this issue would be to utilize some other, more sensitive, 
performance measure, such as resistance to extinction. If we removed food from 
the situation and allowed Rat 1 and Rat 2 to extinguish, we might find that Rat 2 
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Figure 5.1 Hypothetical acquisition and extinction curves for two rats given different 
numbers of acquisition trials. 

would run for no food longer and faster than Rat 1 (see the bottom half of Figure 
5.1). We would then conclude that Rat 2 actually had learned more than Rat 1, 
even though the difference did not appear when we used running speed as our 
measure. 

The point is that, whenever we choose a performance measure, we must assure 
ourselves that it will accurately reflect learning at all levels, or at least at those 
levels likely to be involved in our study. If we do not, then we may be led to 
incorrect conclusions. 


Averaging Data 

GROUP VERSUS INDIVIDUAL DATA. In a typical learning curve trials are plotted on 
the abscissa, and the units in which the behavior is measured are plotted on the 
ordinate. Given such a system it would be helpful if we could plot a single 
individual’s performance across trials and then make some sense of it, but most of 
the time we cannot. Individuals are variable in their behavior and follow a unique 
course through the learning process, and so we are often forced to average the 
scores from several individuals to obtain a clear picture of the stable and consis¬ 
tent trends underlying the variable individual behaviors. 
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Trials 


Figure 5.2 Some hypothetical paired-associate data, showing how the scores of four 
subjects are averaged. 


For example, the top four panels in Figure 5.2 contain hypothetical data from 4 
different subjects as they learned a simple paired-associate list. As you can see, 
their progress through the trials was extremely variable. Subject 1 began slowly 
and finished with a bang. Subject 2 began rapidly and then leveled off. Subjects 3 
and 4 provide equally unique records. If we were left with these four unique 
records, we would not know what to say about how the learning of the list 
progressed. So we combine the four records in an attempt to obtain a clearer 
picture. The averages of the scores of the 4 individuals on each trial are depicted 
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Figure 5.3 Blocking of data. The data from the upper graph are blocked and replotted in 
the graph below. The data points in the lower graph are average numbers correct across 
five adjacent trials in the upper graph. Notice how much easier it is to grasp the gradual 
growth of the learning function in the blocked graph below, even though some detail is 
lost. (These are hypothetical data.) 


at the bottom of Figure 5.2. Here we see something that begins to approximate 
the familiar negatively accelerated learning curve. This trend, or function, was 
obscured by the variability of the individual records. The assumption we must 
make is that, if we could control all the unknown factors that yield high indi¬ 
vidual variability, each individual curve would approximate the average curve. 
We are, in a sense, assuming that the averaged data reveal the “real” learning 
curve. Without averaging we might miss these stable trends, or relationships. We 
would not see the forest for the trees. 

TYPES OF AVERAGING. There are a number of ways to average data, and each 
method has its advantages and disadvantages. Experimenters choose the one that 
best suits their needs and purposes. We have just discussed one of the most 
popular methods, wherein curves are smoothed by combining the scores from a 
number of individuals on each trial. In another method, scores obtained from a 
single individual over a series of runs through the task are combined. In still 
another technique, scores from adjacent trials are combined. For example, one 
might plot average performance in successive blocks of five trials, rather than 
performance on each trial. This technique loses detail, but it does make it easier to 
see general trends by eliminating minor peculiarities in the data (see Figure 5.3). 

Cumulative response curves are also popular. They are curves in which the 
units of behavior, or scores, are allowed to accumulate over trials, or periods of 
observation. For example, Figure 5.4 contains data obtained from a pigeon 
pecking a key for food over a series of 4-minute intervals. Figure 5.4a contains a 
cumulative record of the responses. The same data are presented in Figure 5.4b, 
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Figure 5.4 Comparison between two procedures for presenting results. Panel (a) contains 
a segment of a record obtained from a cumulative recorder. The paper is fed through the 
recorder at a constant speed; each response moves the pen a fixed distance across the 
paper. Thus, the rate of response is shown by the slope of the curve. The pen 
automatically resets to the baseline at some preselected point (in this case, after 500 
responses). The same responses were also indicated on electromechanical counters, and 
those counts are plotted in the panel on the right. The results show the effects of training 
a pigeon with different amounts of food per reinforcement. The bird was reinforced 
occasionally (on the average, once per minute) for pecking an illuminated key. The color 
of the key indicated which reinforcement condition was in effect. Reinforcements 
consisted of delivery of either one, four, or eight hemp seeds. The reinforcement 
condition changed every four minutes. Delivery of reinforcements is shown by a slight 
oblique mark on the record. Among the advantages of the cumulative record are instant 
information concerning the progress of the experiment and an indication of the 
sequential changes in the behavior. The plot of responses trial-by-trial (or 
minute-by-minute, in this case) allows the dependent variable to be read with greater 
ease. In the first minute of this experiment, for example, it is clear from panel (b) that the 
average response rate was approximately 14 pecks per minute, but it would be difficult 
to determine this from the cumulative record. In practice, both cumulative recorders and 
digital counters are usually used, since each form of recording has its own advantages. 
(Adapted from Experimental Psychology , Third Edition, edited by J. W. King and Lorrin 
A. Riggs. Copyright 1938, 1954, © 1971 by Holt, Rinehart and Winston, Inc. Reprinted 
by permission of Holt, Rinehart and Winston, CBS College Publishing.) 


but in this case the average numbers of responses per minute are plotted against 
successive time intervals. 

At any given point on.the cumulative curve we can read the total number of 
responses made from the beginning of the experiment. If the bird stops pecking, 
the cumulative record levels off and remains horizontal until the bird begins 
pecking again. It can never go down. When the bird begins to respond again, the 
curve goes up as we add on new responses. The advantage of the cumulative curve 
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is that we can quickly and easily observe changes in the bird’s rate of responding. 
The steeper the slope of the curve, the faster the bird is responding. The flatter the 
slope, the more slowly it is responding. 

The noncumulative method employed in Figure 5.4b also has its advantages. 
For example, we can quickly determine how many responses the bird produced 
during any given time interval. It is not so easy to obtain this information from 
the cumulative curve. Many experimenters try to use both methods of plotting 
their data, for each yields quick and convenient information about the subject’s 
performance. 

These methods and others (see, for example, Melton, 1936; Munn, 1950; 
Vincent, 1912) are all available to the psychologist. Psychologists as a whole, 
however, have never been able to decide which method is best. Some psycholo¬ 
gists have, in fact, gone so far as to say that group curves are next to worthless in 
attempting to understand individual behavior (Sidman, 1952). Perhaps the best 
we can do at present is to be extremely careful in interpreting averaged data and 
to be aware of the differences among the alternative methods. We should be 
careful about putting too much weight on one method and should attempt to look 
at our data from several different angles. With this cautionary note in mind we 
now turn to a consideration of the two opposing theories of the role of rehearsal 
in learning. 


GRADUAL VERSUS ALL-OR-NONE LEARNING 

Hull’s Approach 

There are two reasons for discussing the theory of Clark L. Hull (1884—1952). 
First, although few psychologists now accept the theory in its entirety, it has had a 
significant impact on the field of learning. Hull’s analyses and his modes of 
thinking are so ingrained in the field that it is difficult to discern what is, and what 
is not, the result of this man’s thought and experimentation. Whatever one 
concludes about the “correctness” of the theory, its heuristic value cannot be 
denied. It has provoked an enormous amount of thinking and stimulated an equal 
amount of research. 

The second reason for discussing Hull’s system is that it provides a good 
example of the incremental approach to the role of rehearsal in learning. Hull 
believed that learning increases gradually as a result of reinforced practice. 
Hence, we can both catch a glimpse of an enormously influential theory from the 
recent past and at the same time pursue our quest for an understanding of the role 
of rehearsal in learning. 

Hull was a behaviorist. Behaviorists are not particularly interested in what goes 
on “in the mind,” or in the content of consciousness. Internal events, such as 
thoughts, ideas, and images, are for the strict behaviorist unmeasurable and 
therefore to be avoided in a science of psychology. What Hull was interested in 
was in making predictions about behavior based on observable, measurable 
qualities of the real world. Measurement and quantification of observables were 
the cornerstones of his approach. 

His system was one of the last “master theories.” Although many of its 
postulates were developed from animal data, the system was designed to account 
for the behavior of all species, in all situations. In other words, he wished to 
encompass all of behavior in one grand system. This hope is revealed in the titles 
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of his books: Principles of Behavior (1943), Essentials of Behavior (1951), and A 
Behavior System (1952). 

Hull’s attempts to explain all behavior with a single system were not entirely 
satisfactory. It was just more than anyone could hope to do. Modern psycholo¬ 
gists limit themselves to “microtheories,” focusing on one specific task (for 
example, serial learning) or one small aspect of behavior (such as recognition 
memory) and attempting to build a model of that limited bit of behavior. Few 
people would now tackle what Hull attempted. 

Hull’s work was complicated, mathematical, and constantly changing. He died 
soon after completing his last book, and in no way did he ever consider his system 
finished. For our purposes we shall consider the system in its simplest form, to 
avoid becoming entangled in its immense complexity. Good, detailed summaries 
are presented by Bower and Hilgard (1981) and Hintzman (1978). 

Once we strip away the great detail, the basic concerns underlying the theory 
are sound. Hull developed an equation for behavior, which can be expressed as 

Response probability = (D x H x K x V) — (/) 

The equation implies that the probability of a response’s occurring is a function of 
the interaction of at least five major variables. The elaboration of the system is 
complicated, but the underlying ideas are not. If you give it a moment’s thought, 
you will probably be able to identify most, if not all, of Hull’s major variables. 
Imagine a rat faced with a lever in a Skinner box. Imagine a child in the kitchen 
faced with a refrigerator. The question is this: What factors will determine 
whether the rat presses the lever and whether the child opens the refrigerator 
door? In both these situations the reward for making the response in question 
(pressing the lever or opening the door) is food. So the hungrier the organism, the 
more motivated each will be to perform the required behavior. The first of Hull’s 
variables, D, or drive, is designed to encompass the influence of how motivated 
the subject is to perform. Drive level is measured in terms of time of deprivation 
or amount of noxious stimulation. The more deprived or the more noxiously 
stimulated the animal, the greater is its drive state and, hence, the greater is the 
tendency to act so as to reduce that drive state. Drive instigates action, and action 
seeks to reduce drive. Hull went through several steps in his thinking about drive, 
but for our purposes we can think of it as a motivational construct tied to, or 
measured in terms of, time of deprivation or amount of noxious stimulation. 

Another factor that might contribute to the rat’s or the child’s tendency to 
respond is how many times each has responded in the past and been reinforced 
for doing so. Experience in a given situation clearly affects the probability of a 
response’s occurring. That is what H, or habit, in Hull’s system is all about. Habit 
reflects how much the organism has already learned. If the child has opened the 
refrigerator door in the past and been reinforced for doing so, then he will be 
more likely to open it in the future. Hull measured H in terms of the number of 
reinforced responses. The more times the rat presses the bar and is reinforced, the 
greater is the habit strength. 

Two points should be made here. First, in Hull’s system reinforcement is 
produced by a rapid diminution of D. A response is reinforced if it is quickly 
followed by a reduction in drive. For Hull, learning, or an increase in H, is the 
result of a response’s being followed by drive reduction. The rat learns to press 
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the bar because it is reinforced for doing so. If it is not fed and D is not reduced, 
learning will not occur. Hull’s drive-reduction interpretation of reinforcement is 
not the only way to think of reinforcement, as we shall see later, but it has been an 
influential conception. The second point is that D activates H; D multiplied 
by H yields a response probability. This conception corresponds to our earlier 
distinction between learning and performance. Performance (response proba¬ 
bility) refers to the activation of learning (H) through the involvement of motiva¬ 
tion (D). 

To return to the thread of the discussion, we should ask what else might 
determine behavior, and the contents of the refrigerator seem likely to have some 
bearing. A chocolate cake might induce the child to respond, whereas something 
less palatable might be less likely to stimulate a response. Hull acknowledged the 
importance of these factors with his K construct. Hull’s K, or incentive motiva¬ 
tion , is measured in terms of the quantity of the goal (the more the better) and in 
terms of the quality of the goal (cake might be better than turnips). The greater K 

is, the greater is the probability of the response’s occurring. K is a “pull” factor; 
the child is pulled toward the cake. Drive, or D, on the other hand, is a “push” 
factor; the child is driven toward the refrigerator by hunger. 

Thus, total motivation includes both D and K; D pushes while K pulls. The 
child is driven toward the refrigerator by hunger and simultaneously attracted by 
the knowledge that the cake is inside. H is Hull’s learning factor, and both D and 
K are motivational terms. 

The clarity of the stimulus situation might also help determine the behavior of 
our organisms. If the rat can barely see the lever, it will be less likely to respond to 

it. If the lever is bright, distinct, and intense, then the rat will be more likely to 
respond to it. It seems that this was what Hull was trying to account for when he 
postulated V, or stimulus intensity dynamism. Hull’s V refers to the probability 
that a response will increase as the intensity of the stimulus increases. In other 
words, we are more likely to respond to intense stimuli than to less intense ones. 

You will note that all four of Hull’s variables that we have discussed thus far 
contribute positively to the probability of a response’s occurring and that their 
relationship is multiplicative. Thus, if any one of them is zero, then response 
probability is zero. The fifth, and last, variable subtracts from response probabil¬ 
ity. Hull calls this last factor 1 , or inhibition. In a very rough sense 7 is a fatigue 
factor. It is measured in terms of the amount of work involved in making a 
response and in terms of the number of times the response has been made. The 
harder it is for the child to open the refrigerator door and the more times in a row 
he has done it, the less likely he is to do it again. 

Thus, there is nothing esoteric about Hull’s model. It is based on a series of 
intuitively obvious assumptions about what might contribute to behavior. It is a 
survival model in which contiguity, practice, and reinforcement arc all important. 
Learning occurs when a response is quickly followed by a drop in drive level. 
Learning occurs in a gradual, incremental fashion and not in an all-or-none 
manner. 

ARE THINGS LEARNED ‘FOREVER’? Another interesting aspect of Hull’s concep¬ 
tion of H is that, once reinforced practice ceases, H does not dissipate, or fade 
away. It remains constant for the life of the organism. Such a conception suggests 
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that learning is permanent. Once we learn something, it is “in storage” forever, 
even though we may not always be able to retrieve it. Does this conception seem 
accurate? 

Much information is stored within us that we cannot retrieve at a particular 
moment. We have all had the experience, while taking an exam, of realizing that 
we know the answer to a question but being unable to come up with it. Many of 
us have had the embarrassing experience at a social gathering of stumbling over a 
familiar person’s name. But do these common experiences mean that all infor¬ 
mation is stored permanently, even if it is occasionally beyond our ability to re¬ 
trieve it? 

ELECTRICAL BRAIN STIMULATION. There are at least two areas of research that 
bear on this issue. First, Penfield (1969) reports work done with patients under¬ 
going brain surgery. While under local anesthesia the patients were conscious and 
able to report their experiences. As part of the surgical procedures the patients’ 
exposed brains were lightly stimulated with an electrical current. As the electrode 
was touched to the surface of the cortex, the patients reported extremely vivid, 
distinct memories of past events. As long as the electrode was held in place, the 
memory persisted. As soon as the electrode was removed, the memory ceased. 
Some of the memories appear to have been accurate and to have represented 
events from the distant past. Many of the memories were of events that the 
patients had completely “forgotten” or at least had not thought of for long 
periods. The implication here is that memories are somehow stored in the brain 
permanently even though we cannot normally retrieve them. It took the electrical 
current to activate these long-dormant memories. 

Although Penfield’s work certainly suggests that many memories are stored and 
inaccessible, it does not prove that all memories are stored permanently. There 
may still be much information that is encoded but eventually lost. In addition, 
there is some question whether Penfield’s patients retrieved accurate memories of 
true events or merely experienced new sensations that resembled old events. 

HYPNOTIC AGE REGRESSION. Under hypnosis some patients can be encouraged 
to “go back” in their life, or to “become” younger and younger (McConnell, 
1974). Obliging subjects begin to act so as to suggest that they are becoming 
younger; they use baby talk and physically act like children. In addition, they 
appear to have memories of their childhood that are attainable only under the 
hypnotic trance. You can see that these results might be used to bolster the 
argument that memories are permanent. 

But there has been a great deal of criticism of the age-regression phenomenon. 
First, many investigators (Barber, 1970; Orne, 1970) argue that hypnosis itself is 
overrated. They argue that many subjects are “role playing,” or acting the way 
they think the hypnotist wants them to act. Nonhypnotized subjects can do many, 
if not all, of the highly publicized hypnotic tricks, such as suspending oneself 
between two chairs by the head and the heels. These investigators feel that 
subjects want to oblige the hypnotist and so go along with her or his suggestions. 
Second, many of the memories recalled by hypnotized subjects are no more 
accurate than the memories of nonhypnotized subjects (O’Connell, Shore, & 
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Orne, 1970). In other words, there is enough controversy surrounding the age- 
regression phenomenon to suggest that we take it with a grain of salt. 

Neither age regression nor Penfield’s work conclusively proves that memories 
are lifelong. Although Hull postulated that they were, his position has yet to be 
verified experimentally. 

We shall return to Hull’s system from time to time, as it has influenced many 
subareas of learning. For now we should keep firmly in mind the fact that it is an 
incremental theory in which learning is assumed to increase in a gradual fashion 
as reinforced practice progresses. To the student interested in the developments in 
the Hullian tradition, the work of Miller (1959), Mowrer (1960), and Spence 
(1956, 1960) is relevant. 

Guthrie’s and Estes’ Approaches 

We now turn to a consideration of the all-or-none approach. Although there have 
been many variations of the basic all-or-none position (see Millward, 1971), we 
shall limit our discussion to two of the more influential systems. First, Guthrie’s 
original interpretation of learning laid the groundwork for later, more formal, 
expressions of the all-or-none position. Second, Estes’ stimulus-sampling model 
represents one of these more recent mathematical statements of Guthrie’s earlier 
ideas. 

The very simplicity of Guthrie’s theory is intriguing. It can be summed up in 
one sentence: A combination of stimuli that has accompanied a movement will on 
its recurrence tend to be followed by that movement (Guthrie, 1935, 1952). If a 
response occurs in the presence of a stimulus, learning will be complete and 
automatic. No further practice is necessary, and no reinforcement is necessary. 

On the surface Guthrie’s statement seems contrary to the real world. It implies 
that learning occurs completely in one pairing of the stimulus and the response. 
Yet we know that in most situations (for example, a rat pressing a bar for food), 
the strength of a response seems to grow gradually as practice continues. We have 
already seen that S—R contiguity, though perhaps necessary, does not appear to be 
sufficient for learning to occur. To resolve this apparent contradiction Guthrie 
distinguishes between acts and movements. Acts are the complex responses that 
we observe and study. Riding a bicycle, discovering a concept, saying a word, and 
typing a letter are acts. Each of these total acts is made up of many different 
movements. For example, a movement in bicycle riding might be pushing down 
with the left foot or grasping with the right hand. Each individual movement is 
learned in one trial, although mastery of the total set of movements may require 
many trials. To master the total complex act, many different responses must be 
connected to many different stimulus configurations. The gradual improvement 
with practice that we observe is the result of a growing number of learned 
movements, each of which is acquired in all-or-none fashion. 

Estes (1959) attempted to translate Guthrie’s basic notions into a more formal, 
mathematical expression. We present his stimulus-sampling model here in sim¬ 
plified form. Students interested in the many versions and elaborations of statis¬ 
tical learning theory are referred to Atkinson, Bower, and Crothers (1965), 
Atkinson and Estes (1963), Bower & Hilgard, 1981, Estes and Suppes (1959), 
Kintsch (1977), and Millward (1971). 
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We can restate Guthrie’s original proposition in the following form: 

1. If Rj occurs in the presence of Si, then R] will be connected to St completely 
and automatically. 

2. If a different response (R 2 ) occurs in the presence of Si, then R 2 will be 
completely and automatically connected to S l5 and R! will be completely 
disconnected from Sj. In other words, a stimulus can be connected to only 
one response at a time, and it will be connected to the most recently 
occurring response. 

3. The probability of a response’s occurring equals the proportion of presented 
stimuli to which it is connected. 

Let us try to understand these statements. The critical point is that the probabil¬ 
ity of a response’s occurring equals the proportion of the presented, or sampled, 
stimuli to which it is connected. This implies that any given situation in which a 
response can occur probably consists of many stimuli, each of which may or may 
not be connected to the response. Imagine the total set of stimuli that can impinge 
on an animal in a learning situation (say, in a bar-press paradigm). This is not just 
the set of stimuli that are actually impinging on it at any given moment, but all the 
various stimuli that might possibly impinge on it at different times. These stimuli 
include all the sights, sounds, smells, odors, tastes, and so on that can act on the 
animal. No one really knows how many there are or, for that matter, exactly what 
they are. This total set of stimuli that must exist but cannot be measured is 
represented at the upper left of Figure 5.5. Imagine further that some of these 
many stimuli are already connected to the bar press. Next imagine a subset of 
stimuli that actually act on the animal at a given moment. This subset represents a 
sample drawn from the larger total set. That is, at any given moment the animal is 
facing in a particular direction, it is hearing certain sounds, smelling certain 
odors, and so on. The animal cannot experience the total set at any given 
moment, but it does experience a subset of the total set. 

Figure 5.5 contains a very stylized example of the way in which learning 
progresses in the stimulus-sampling model. Focus on the first subset of four 
stimuli drawn from the total set. Two of these stimuli have already been con¬ 
nected to the response, and two have not. At this point the probability of a 
response’s occurring due to learning is .50. (The italics are important here, for the 
response may actually occur with a much higher probability. For instance, in 
classical conditioning the occurrence of the response is ensured on every trial by 
the presence of the UCS. Thus, the .50 probability refers to the probability of the 
response’s occurring due to learning and does not encompass the fact that the 
UCS evokes the response on each trial.) If the response does occur, then the two 
previously unconnected stimuli become completely connected. The four stimuli, 
now all connected to the response, are returned to the total set. The proportion of 
conditioned elements in the total set has now increased to .80. On the next trial a 
second subset is sampled. Because all stimuli are equally likely to be sampled, the 
proportion of connected'stimuli in this second subset is likely to be higher than in 
the preceding subset. Thus, the probability of the response’s occurring due to 
learning is higher the second time around (.75 in Figure 5.5), and so on. On each 
successive trial a subset of stimuli is sampled from the total set. If the response 
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Figure 5.5 The stimulus-sampling model: (o) stimulus connected to response (•) stimulus 
not connected to response. 


occurs, then all the unconnected elements of the subset are connected to the 
response, and the entire subset is returned to the total set. There is a gradual 
growth in the proportion of connected stimuli on successive trials. The model, 
although it considers learning to be an all-or-none event, predicts the incremental 
result that we obtain in the real world. In fact, it predicts a negatively accelerated 
performance curve, because the proportion of connections that can be established 
on successive trials must eventually decrease as the supply of unconnected stimuli 
is gradually exhausted. 


What, Actually, Is One Trial? 

There have been a number of attempts to pit the all-or-none and the gradual- 
growth positions against each other experimentally (Estes, 1960; Rock, 1957; 
Voeks, 1954). Unfortunately, these clever and interesting attempts have failed to 
provide conclusive support for either position. The all-or-none position has 
probably faired worse than the gradual-growth hypothesis (Battig, 1968; Jones, 
1962; Postman, 1962a; Underwood & Keppel, 1962), but neither hypothesis has 
scored a clear victory. 

One of the problems inherent in many of these comparative experiments has to 
do with the nature of “the trial.” What is one trial? Is it one presentation of the 
stimulus and the response by the experimenter? Is it one subvocal rehearsal by the 
subject? Is it one, but only one, firing of some complex, unknown sequence of 
neural activities? These questions are seldom addressed in all-or-none research. 
For example, implicit in some comparison experiments is the assumption that one 
presentation of the materials represents, or corresponds to, a single occurrence of 
the response in the presence of the stimulus. And yet, if we look a little closer, it 
seems that these experiments must involve multiple repetitions, or rehearsals. If 
we present a stimulus and a response to a subject for 2 seconds, more than likely 
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she rehearses the pair as many times and in as many ways as she can before the 
next pair appears. Humans are certainly capable of several such mental rehearsals 
in 2 seconds. So far, no one has been able to control the number of rehearsals 
perfectly. 


Multiple-Trace Versus Strength Models 
Although research on the controversy over one-trial versus incremental learning 
has died down lately, a newer theoretical distinction has been receiving increasing 
attention. Specifically, a number of investigators have been wondering whether 
the relationship between practice and learning is better thought of in terms of a 
multiple-trace conception or a simple strength interpretation (see Ghatala, Levin, 
Bell, Truman, & Lodico, 1978; Hintzman, 1976; Hintzman & Stern, 1978; 
Whitlow & Skaar, 1979). Basically, both approaches are concerned with the 
relationships between practice and learning. They differ in terms of the postulated 
effects of rehearsal. The multiple-trace position argues that each presentation of a 
stimulus is stored as a separate, distinct representation of that stimulus. Strength 
theory, on the other hand, maintains that each presentation of an item adds to the 
strength of one single representation of that item. 

Consider Figure 5.6. Assume for the moment that you are not familiar with the 
letter B and are trying to learn it. It is presented to you for rehearsal in a series of 
trials by the experimenter. According to the multiple-trace theory, each time you 
perceive B, it is somehow stored as a discrete representation. As trials progress, 
you add to your growing store of little, internal Bs. According to the simple 
strength theory, something quite different happens each time you perceive and 
rehearse B. Here the argument is that you have only one single internal represen¬ 
tation of B but that this single B grows in strength as trials progress. 

It should be made clear that neither of these theories argues that we actually 
have physical Bs stored somewhere within us. The business with all the little Bs or 
the one growing B is merely a convenient way to think about how our memory 
system operates. By adopting one or the other of these positions we are merely 
assuming that the memory system operates as if we had little Bs or one big B 
stored within us. 

Multiple-trace theory will appear in connection with a number of other issues 
within this text. For now, it is important to realize that these two conceptions of 
the role of practice in learning are currently under close scrutiny. 

Exactly how the distinction between multiple-trace and strength theories re¬ 
lates to the issue of one-trial versus incremental learning, just discussed, is not 
entirely clear. In one sense it can be argued that the strength position resembles the 
incremental position more than it does the one-trial position, whereas the multi¬ 
ple-trace theory resembles the one-trial position more than it does the incremental 
position. 

Specifically, the strength model argues that a single internal representation of 
the external object grows in strength with repeated exposures to that stimulus. 
The multiple-trace model, in contrast, contends that discrete, distinct representa¬ 
tions are added on each practice trial and that the overall strength of storage is, in 
some sense, reflected by the sum total of all these discrete representational 
elements. The more times an item is presented, the more traces of that item there 
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Figure 5.6 Symbolic representations of the multiple-trace (a) and strength (b) models of 
learning and memory. The multiple-trace model argues that each presentation of an item 
is encoded in memory as a discrete, unique event. The strength hypothesis proposes that, 
each time an item is presented, the strength, or potency, of a single trace is increased. 

will be in memory, and the more traces are stored, the more likely the item is to be 
found when we search our memory for it. 

The way to choose between these two interpretations is to find predictions that 
they make that differ. We can then test to see which theory makes the more 
accurate prediction. Although the issue is far from resolved, multiple-trace 
theories have been fairly accurate (see Hintzman 8c Block, 1971; Whitlow 8c 
Estes, 1979). 

Whatever the final outcome of the controversy concerning the multiple-trace 
versus strength models, you should realize that this distinction represents the 
ongoing concern psychologists have for the relationship between learning and 
practice. 


MASSED VERSUS DISTRIBUTED PRACTICE 
The study of massed practice (MP) versus distributed practice (DP) has a long 
history. In its simplest form the MP—DP controversy is this: If our practice trials 
are spaced, or separated in time, is learning more or less efficient than if the trials 
are bunched together? If we have 6 hours to study for three exams, should we give 
each topic 2 consecutive hours, or should we give each topic eight 15-minute 
segments separated by intervals during which we study one of the alternative 
topics? The most common hypothesis has been that DP will be superior to MP. 
Early experimental data bearing on this issue were extremely complicated and 
often contradictory (see Archer, 1954; Jung 8c Bailey, 1966; Kimble, 1949; 
Underwood, 1961). But more recent investigations have begun to yield consistent 
MP—DP effects. Melton (1970) points out that these new and significant effects 
began to appear as new verbal tasks were developed. For example, significant 
MP—DP effects have appeared in free-recall learning situations (Madigan, 1969; 
Underwood, 1970; Waugh, 1970). You will recall that items in this type of 
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experiment are presented randomly, and the subject is asked to recall them in any 
order. Although there have been exceptions (Waugh, 1967), the general finding is 
that recall of items presented twice is better if the two presentations are separated 
by one or more intervening items. DP seems to be superior to MP. In general, 
spaced presentation seems to lead to superior retention in free-recall situations 
(Hintzman, Block, & Summers, 1973; Shaughnessy, 1976). 

Melton (1970) has labeled this phenomenon the lag effect. As can be seen in 
Figure 5.7, the effect refers to the fact that in free recall the probability of recalling 
a repeated item is positively related to the number of intervening items. The 
farther apart the two presentations of the repeated item, the better is the recall of 
that item. In other words, distributed practice is better than massed practice. 

Contextual Interpretation 

Although there have been other interpretations of the MP—DP lag effect (for 
example, Elms, Dye, & Herdian, 1983; Glenberg, 1977, 1979; Greeno, 1970), 
one of the more popular has to do with subjects’ ability to code an item in a 
context of other items (D’Agostino & DeRemer, 1973; Elmes, Greener, &C Wilkin¬ 
son, 1972; Gartman & Johnson, 1972). If the two presentations of the critical 
item are separated by other items, then each presentation of the critical item 
occurs in a unique verbal context and is surrounded by a variety of different 
items. The subjects can associate the critical item with these different contexts, 
thereby enriching their coding of the critical item. Massed presentations, in 
contrast, represent a reduction in the richness and variety of the context in which 
the critical item is to be learned. Both presentations occur in roughly the same 
context. 


En co ding- Variabili ty In terpreta tion 

Encoding variability (see Martin, 1971) refers to the idea that any given stimulus 
is composed of many different attributes, or components. Thus, the stimulus dog 
is composed of three different letters, a sound, three letter positions, and so on. At 
any given time a response may be connected to part of the overall stimulus but not 
necessarily to the entire stimulus. If you wanted to associate doll with dog, you 
might pay attention to the letter d in dog and not to the o or the g. Doll could 
also be associated with dog in any number of other ways, however. That is, the 
encoding can be variable. The next point to consider is that the more diverse and 
rich the encoding, the stronger the association will be. If doll can be connected 
with dog in a number of different ways (for example, using one letter at a time 
and using the whole word dog), then the association between dog and doll will be 
strong. The richer and more elaborate the coding—that is, the more different 
ways doll is associated with dog —the better is the association. 

What has all this to do with massed and distributed practice? The argument is 
that distributed practice leads to richer and more diverse, more variable, coding. 
When a given item (say, two paired nouns) is presented twice close together in 
time, the subject is likely to use the same code each time (for example, “Do// goes 
with d”). But if the item is distributed in time, the subject is more likely to use two 
different codes on the two presentations (“Do// goes with d” on the first presenta¬ 
tion and “Do// goes with the word ending in g” the second time the pair is 
presented). 
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Figure 5.7 Probability or recall of words that occur once or twice, with varying numbers 
of other words occurring between the two presentations of a given word. Presentation 
rates: (o) 4.3 seconds; (□) 2.3 seconds; (A) 1.3 seconds. (Adapted from Melton, A. W. 
The situation with respect to the spacing of repetitions and memory. Journal of Verbal 
Learning and Verbal Behavior , 1970, 9, 596—606. Fig. 1, p. 602.) 


In other words, encoding is likely to be more variable under distributed 
conditions than under massed conditions and thus is likely to be stronger. The 
encoding-variability interpretation of the superiority of distributed practice is 
similar to the contextual interpretation just discussed. The difference between the 
two is that the contextual interpretation focuses on the variability of the contex¬ 
tual stimuli (what surrounds the item), whereas the encoding-variability hypoth¬ 
esis emphasizes variability within the item itself. Both factors may well contribute 
to overall MP—DP effects. 

Students often ask whether the MP—DP effect means that we should distribute 
our practice, say, when we study for exams. It is probably too early to be definite 
about this sort of thing. On the one hand, although DP has been shown to be 
superior to MP, one must remember that these effects have been obtained with 
very limited kinds of task in very limited experimental situations. Whether the 
effects can be generalized to situations outside the laboratory remains to be seen. 
On the other hand, it is also incorrect to say that we can forget about patterns of 
rehearsal because they are unimportant. An earlier hypothesis, called the total 
time law (Bugelski, 1962), referred to the idea that the amount of learning that 
will occur in a given interval is constant, regardless of how that time is spent in 
rehearsing. In other words, if we study a list of 10 words for 1 minute, we will 
learn the same amount, regardless of how we spend the study time. It should not 


CHAPTER 5/THE ROLES OF CONTIGUITY AND REHEARSAL IN LEARNING 139 












make much difference whether we go over the list six times, spending 1 second on 
each item each time through the list, or go through the list only once, spending 6 
massed seconds on each item. Clearly, significant MP-DP effects deny the validity 
of this total time law; patterns of practice do make a difference. 

LEVELS OF PROCESSING 

Thus far we have discussed the controversy over one-trial versus incremental 
learning and several issues relating to patterns of practice. In all these areas we 
have been concerned with the quantity of rehearsal and the patterning of that 
rehearsal. Recent years have witnessed a new emphasis, or a shift of concern, in 
connection with the relationship between practice and learning. Specifically, there 
has been growing interest in the quality, or type, of rehearsal. Now psychologists 
are interested not only in how much we rehearse but also in what kind of practice 
we engage in. As we shall see, both the durability of learning and its ultimate 
usefulness depend heavily on the kind of rehearsing we do. 

The new emphasis on the quality rather than the quantity of rehearsal began 
with, among other things, the introduction of the concept of levels of processing, 
developed by Craik and Lockhart (1972). Essentially, the phrase refers to the idea 
that the durability of learning depends on how we rehearse. More specifically, the 
“deeper” our rehearsal, the better and more durable is the learning. Craik and 
Lockhart suggest that we can rehearse materials (for example, a list of words) 
merely by attending to the physical nature of the stimulus (for example, what it 
looks like). This is considered to be relatively “shallow” processing and results in 
relatively weak learning. Or we can rehearse materials by attending to their sound 
(a little deeper form of processing). Finally, we can process or rehearse words on a 
semantic level (that is, having to do with the meaning of the word). Semantic 
processing represents “deep” processing and results in durable traces, or durable 
learning. The deeper the level of processing, the more durable is the trace. 

Support for the Concept 

An experiment by Craik and Tulving (1975) will serve as a good example of the 
type of experimental design most often used in the study of levels of processing. 
Subjects were shown a long series of words for very brief periods. These words 
were flashed on a screen, one at a time, for less than 1 second each. In some 
instances subjects had to judge whether the word flashed on the screen was typed 
in capitals. This is a form of shallow processing in which the subject’s attention is 
drawn to the physical, structural properties of the stimulus. In a second kind of 
condition the subjects had to judge whether the flashed stimulus word rhymed 
with another designated word. This kind of phonemic, acoustical processing 
represents an intermediate level of processing. Finally, in the condition involving 
the deepest level of processing the subjects had to decide whether the flashed word 
fit into an empty space in a sentence. This semantic level of processing involves 
the meaning of the word. Then, in a surprise test the subjects were asked to recall 
all the words. As expected, in agreement with the levels-of-processing hypothesis 
that deeper processing will lead to more durable learning, items in the semantic 
condition were recalled more accurately than items in the phonemic condition, 
which, in turn, were recalled more accurately than the items in the structural 
condition. Recall increased as depth of processing increased. 


140 PART TWO/ACQUISITION 







Table 5.1 Levels of Processing Tasks 


LEVEL OF PROCESSING 


SHALLOW INTERMEDIATE DEEP 

(STRUCTURAL) (ACOUSTIC) (SEMANTIC) 


Does this word begin with 
a capital? 


Does this word have an “r” Can you hold this in 

sound in it? your hand? 


Do these two words begin with 
the same letter? 


Does this word rhyme with 
this other word? 


How familiar is this? 


How many syllables does this 
word have? 


Where does the accent fall in Is this a word? 

this word? 


Is this word typed in lower 
case letters? 


Does this word end in an How pleasant is this? 

“est” sound? 


Adapted from Houston, J. P., Bee, H., and Rimm, D. C. Invitation to psychology. New York: Academic Press, 1983. Fig. 6, p. 236. 


This experimental design is widely used. It really involves incidental learning, 
because the subjects are never actually instructed to learn the words, only to 
orient toward them and to make some judgment about them. The recall test is 
always a surprise. 

The fact that level of processing relates to durability of trace in many popula¬ 
tions of humans has been well documented (Bellezza, Cheesman, & R.eddy, 1977; 
Cermak & Reale, 1978; Craik & Tulving, 1975; McDowall, 1979; Moeser, 
1983). As it turns out, there are many different ways to rehearse materials. Some 
are more efficient than others, and some are more suited to a particular task than 
others. For example, rehearsing words by inventing mental images of them may 
be particularly helpful in one situation, whereas rehearsing the same words in 
terms of their verbal meanings may be more helpful in another situation (see 
Peterson, Thomas, & Johnson, 1977). 

Some of the types of task that have been used to produce different levels of 
processing are contained in Table 5.1. Notice in the last column of the table that 
rating the pleasantness of a word is categorized as an instance of deep processing. 
Packman and Battig (1978) and others have found that rehearsing or processing 
words by rating them in terms of pleasantness actually does seem to be a good 
way to ensure recall of those words. They gave subjects lists of words and asked 
them to rate the words on various dimensions. Some words were rated in terms of 
how pleasant they were, whereas others were rated in terms of such factors as 
familiarity, meaningfulness, imagery, concreteness, and number of attributes. 
Uniformly, the words rated in terms of pleasantness appeared to be recalled best 
at a later time, whereas the remaining modes of processing did not appear to 
differ in terms of how well words were recalled. 

Level of processing has been related to many other areas of investigation. For 
example, Smith, Theodor, and Franklin (1983) showed that the priming effect can 
be influenced by depth of processing. You will recall from Chapter 1 that the 
priming effect refers to the fact that, if one word is shown for a split second and 
then the subject is asked to make some decision about a second word, the 
decisions about that second word will be faster and more accurate if the first word 
(the prime) is related to the second word. Smith, Theodor, and Franklin took the 
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Type of processing task 


Figure 5.8 Number of words recalled as a function of level of processing. (From 
Cognition by Margaret Matlin. Copyright © 1983 by CBS College Publishing. Reprinted 
by permission of Holt, Rinehart and Winston, CBS College Publishing.) 


investigation one step further and varied the depth of processing of the prime. 
They found that, the deeper the prime was processed, the faster and more 
accurately decisions were made about the second word. In order of increasing 
depth, the levels of processing they used were these: 

1. Respond if a star is next to the prime. 

2. Respond if a certain letter is in the prime. 

3. Respond if the prime has more than one syllable. 

4. Read the prime. 

5. Respond if the prime is a living thing. 

In another interesting study Rogers, Kuiper, and Kirker (1977) propose a level 
of processing that may even be deeper than semantic processing. They call this 
deepest of all processing tasks self-reference. In this kind of task the subject is 
asked things such as “Does this word describe you?” and “Have you ever done 
this?” As you can see in Figure 5.8, when it was compared with the usual 
processing procedures, self-reference was the most effective. Apparently, if we 
relate something to our own wonderful selves, we really remember it well. 

Criticism of the Concept 

The levels-of-processing approach has been heavily criticized on a number of 
grounds. First, it has been characterized as being too vague and ill-defined to be 
accepted as a substantial, testable theory. Second, the definition of depth of 
processing is somewhat circular (Baddeley, 1978). It is said that deeply processed 
words are the ones that are remembered and that remembered words must have 
been deeply processed. The problem is that there is no independent measure of 
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Number of repetitions 


Figure 5.9 Percentage correct on three memory tests as joint function of number of 
repetitions (one versus two) and depth of processing (semantic versus phonemic). 

(Adapted from Nelson, T. O. Repetition and depth of processing. Journal of Verbal 
Learning and Verbal Behavior ; 1977, 16, 151-171.) 

depth of processing currently available (Nelson, 1977). Third, there are important 
exceptions to the rule that deeper processing leads to more durable traces. For 
example, the levels-of-processing approach argues that mere rote repetition of 
material at a given level will not help us to remember it; we must process at a 
deeper level if we wish to improve the durability of the trace. But Nelson (1977) 
reports data suggesting that rote rehearsal does, after all, help us remember things 
and that it is not absolutely essential to increase the depth of processing. He had 
subjects process at a phonemic level (“Does the word contain an ‘r’ sound?”) or a 
semantic level (“Does the item represent a living thing?”). In addition, subjects 
were given either one or two repetitions of the entire list of 30 words. Finally, they 
were tested for retention with uncued (unprompted) recall, cued (prompted) 
recall, or recognition. As you can see in Figure 5.9, regardless of type of recall test 
or level of processing, repetition did help retention. This is contrary to the 
levels-of-processing approach that argues that mere repetition will not increase 
retention provided the depth of processing remains constant. Similarly, Kolers 
(1976) had subjects read some material upside down. One year later they were 
still able to recall some of this material. The argument—which you may disagree 
with—is that the processing that goes on when you read something upside down 
is relatively shallow. (You are just trying to figure out what those inverted words 
and letters are, much less what they relate to.) And yet this information, processed 
on such a shallow level, was recalled a year later. In other words, these data 
suggest to some that deep processing is not necessary for the formation of durable 
traces. 

On the basis of this sort of evidence, Baddeley (1978) concludes that deeper 
processing can lead to more lasting learning but that it is not always essential. 
Still, even though it may not always be necessary, learning often appears to 
become more durable as the level of processing becomes richer and deeper. 

Two Hypotheses: Elaboration and Distinctiveness 
Walker and Jones (1983) and others have tried to explain exactly why it is that 
deeper processing leads to better retention. Two hypotheses have been proposed. 
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The first of these, the elaboration hypothesis, maintains that semantic processing 
leads to richer, more varied encoding. When you process the word cat semantical¬ 
ly (in terms of its meaning), you are more likely to store more information and 
more elaborate information than you are if you try to process it structurally (in 
terms of what the word looks like). The idea is that complex, varied, substantial 
information stimulated and stored by semantic processing will lead to better 
retention than will the skimpy information stimulated by shallower processing. 

The second hypothesis, called the distinctiveness hypothesis, holds that deep, 
or semantic, processing leads to better retention because it tends to help make the 
stimulus more distinct from other information in storage. Cat is more likely to 
become distinct from other stimulus words if it is processed in terms of its 
meaning rather than in terms of its simple physical structure. Perhaps deeper 
processing will eventually be shown to increase both elaboration and distinc¬ 
tiveness. 

These and other conceptual advances are helping to overcome some of the 
difficulties faced by the levels-of-processing approach. However, skepticism re¬ 
mains high concerning its eventual fate. We shall look at this approach in more 
detail in Chapter 11 when we address and contrast current models of memory. 
Whatever the final outcome with respect to the levels-of-processing analysis, 
Craik and his associates have done the field a service by underscoring the fact that 
how we rehearse is as important as how much we rehearse. Still other areas of 
research and thought make this same point, as we shall now see. 

TYPE I AND TYPE II REHEARSAL 

In the context of this concern for levels of processing a number of authors have 
proposed two categories of rehearsal activity (Craik & Lockhart, 1972; Glenberg, 
Smith, &C Green, 1977; Rundus, 1977; Woodward, Bjork, & Jongeward, 1973). 
Type I rehearsal, or maintenance rehearsal, is designed to hold information in 
mind, to keep it ready, but not to process it so deeply that it becomes part of our 
permanent store of information. Here we are concerned with shallow rehearsal, 
such as the verbal practice of a telephone number so that we do not forget it while 
we search for a pencil and paper. Type II rehearsal, or elaborative rehearsal, is a 
deeper process in which we try to prepare materials for future use. Here we might 
try to practice the telephone number in such a way that we can forget about it 
now and yet still be able to recall it later. To do this we might have to process at a 
deeper level. We might make up a story using the numbers involved or scan the 
number for associations such as dates or ages that are meaningful to us. In a 
sense, Type II rehearsal is whatever we do to transfer memories from temporary 
to permanent storage. 

Although intriguing, the distinction between Type I and Type II rehearsal is not 
entirely clear cut. For example, Naire (1983) found that, contrary to expectations, 
Type I rehearsal does sometimes lead to more permanent memory. He presented 
digits to subjects and told them that they would be tested on them at the end of a 
retention interval. The subjects were told that the filler task (rehearsing paired 
associates) was unimportant and was merely required to prevent them from 
rehearsing the digits, which were supposed to be the critical items. Then, in a 
surprise, Naire tested for recall of the paired associates. He found that the 
subjects could recall some of them. This wasn’t supposed to happen; Type I 
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Figure 5.10 The task is to label the pictures as quickly as you can. (Adapted from 
Rosinski, R. R., Golinkoff, R. M., and Kukish, K. S. Automatic semantic processing in a 
picture-word interference task. Child Development, 1975, 46, 247—253.) 

rehearsal is not supposed to lead to permanent storage. So we must be cautious 
about the Type I—Type II distinction. 

CONTROLLED VERSUS AUTOMATIC PROCESSING 
Another interesting distinction being made in the literature is between controlled 
and automatic processing, or between processing that requires mental effort and 
processing that requires no conscious effort (Posner & Snyder, 1975; Schneider 8: 
Shiffrin, 1977). 

Controlled processes are seen as requiring mental effort, being done one at a 
time, and being limited by our available mental capacity. Automatic processing, 
in contrast, requires no effort, can involve parallel processing of more than one 
thing at a time, and is not limited in terms of capacity. Controlled processing is 
seen as being slower than automatic processing. 

Generally speaking, anything we do consciously of our own free will involves 
controlled processing, and anything we do without being aware of it involves 
automatic processing. An example should help make clear the distinction between 
these two kinds of processing. You can get a feel for the difference by first labeling 
the objects shown in Figure 5.10a. That’s easy, isn’t it? Now label the objects 
presented in Figure 5.10b. You will probably have trouble here,..because it is 
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Table 5.2 An Example of Each Elaborator Type Reported for the Paired Associate 
Doctor—Stone 


ELABORATOR TYPE ELABORATORS 


Related to both 
Stimulus related 
Response related 
Idiosyncratic 


gallstone 

Mr. 

Flintstones 

movie 


Adapted from Wang, A. Y. Individual differences in learning speed. Journal of Experimental Psychology, 1983, 9, 300—311. 
Table 1, p. 302. 


difficult to ignore the incorrect words printed on the pictures (Rosinski, Golin- 
koff, & Kukish, 1975). The idea here is that the verbal label “looks up” its visual 
representation automatically whether you want it to or not (Wood, 1983). Con¬ 
trolled processing (naming the drawings) is pitted against automatic processing 
(thinking of the representation of the word) (see also Reiner & Morrison, 1983). 
The verbal label is more likely to interfere with the naming of the visual object 
than vice versa, because we can read words faster than we can label pictures. You 
can demonstrate this by reading the words in Figure 5.10b. You shouldn’t have 
much trouble. 

We shall return to the distinction between controlled and automatic processing 
several times, especially in our discussion of memory. For now it is enough to 
realize that we sometimes process enormous amounts of information without 
being aware of it and that mental effort is sometimes required to get the job done. 

ELABORATIVE REHEARSAL 

Our discussion of rehearsal has led us to the inescapable fact that, when we 
rehearse, we don’t merely repeat a response over and over. Although we can, if we 
wish, simply repeat an item, we are much more likely to organize, interpret, 
relate, catalog, and generally elaborate when we rehearse. We are active, creative 
rehearsers; we try to enhance learning by tailoring our rehearsal methods to suit 
the task at hand. 

Elaborative rehearsal is such an important process and covers so many topics 
that an entire chapter (Chapter 11) is devoted to it. For now, two examples of the 
many kinds of elaborative rehearsal that have been identified should suffice. 

Let’s begin with an example of elaborative rehearsal in paired-associate learn¬ 
ing. It is logically possible that we learn a paired associate such as doctor—stone 
merely by repeating the elements over and over. But that isn’t what actually 
happens. To the contrary, we usually learn paired associates through the use of 
many different elaborative strategies. We can relate the stimulus to the response, 
we can make up an image, or a “mental picture,” involving the two units, and we 
can make up a sentence, story, or rhyme using the two elements. 

Wang (1983) had subjects learn paired associates and then asked them how 
they had learned the ftems. Table 5.2 contains some of the types of elaboration 
actually used by the subjects. As you can see, they sometimes produced elabora- 
tors that helped them learn the item doctor—stone by linking the stimulus and the 
response (gallstone). Sometimes they elaborated on the stimulus alone (Mr.), 
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Table 5.3 Arrangement of Stimulus Materials Into Two Organizational Structures 


TAXONOMIC ORGANIZATION 

SCHEMATIC ORGANIZATION 

Food 

eat pineapple 
eat peanuts 
eat birthday cake 
drink hot chocolate 
drink champagne 

Going skiing 
go to mountains 
put on down jacket 
buy lift ticket 
ski down slopes 
drink hot chocolate 

Clothing 

put on evening clothes 
put on team cap 
put on paper hat 
put on bathing suit 
put on down jacket 

Going to a ballet 
buy opera glasses 
put on evening clothes 
go to theater 
watch ballet 
drink champagne 

Places 

go to mountains 
go to Hawaii 
go to theater 
go to party 
go to stadium 

Going to a party 
buy present 
go to party 
put on paper hat 
eat birthday cake 
play charades 

Activities 
play charades 
watch ballet 
ski down slopes 
watch baseball game 
swim in ocean 

Going to a baseball game 
put on team cap 
go to stadium 
buy admission ticket 
watch baseball game 
eat peanuts 

Things one would buy 
buy admission ticket 
buy lift ticket 
buy plane ticket 
buy opera glasses 
buy present 

Going to Hawaii 
buy plane ticket 
go to Hawaii 
eat pineapple 
put on batning suit 
swim in ocean 


Adapted from Rabinowitz, M., and Mandler, J. M. Organization and information retrieval. Journal of Experimental Psychology , 
1983, 9, 430-439. Table 1, p. 432. 


sometimes on the response (flintstones), and sometimes in an idiosyncratic, or 
uniquely individual, fashion (movie). 

Wang found that learning was best accomplished when the elaboration in¬ 
volved both the stimulus and the response. He also found that fast learners do 
more elaborating than slow learners. And finally, he found that, if he took the 
elaborations developed by fast and slow learners and gave them to new subjects, 
the new subjects receiving the fast learners’ elaborations did better than subjects 
receiving elaborations from slow learners. In other words, rehearsal involving 
more elaboration and more complex elaboration seems best. 

The above example indicates one way in which we can elaborate within a single 
to-be-learned unit. But elaboration can help us when we are trying to learn an 
entire list as well. Rabinowitz and Mandler (1983) gave two groups of subjects 
the same list of items but organized the items differently in each condition. As you 
can see in Table 5.3, half the time the phrases were put into what the authors call 
a taxonomic organization, and half the time the exact same phrases were pre¬ 
sented to the subjects in a schematic organization. Which organization do you feel 
would lead to better learning? Their results indicate that the schematic organiza¬ 
tion led to significantly better learning of the phrases. The authors speculate that 
this superiority may be due to the fact that items presented schematically are more 
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easily used in making up a single mental image, or scene, than those presented 
taxonomically. 

As we shall see, these two examples of elaborative rehearsal do nothing more 
than scratch the surface of the kinds of elaboration that will be discussed in 
Chapter 11. The point to remember here is that rehearsal is turning out to be a 
multifaceted concept; it involves much more than simply repeating a response 
over and over. When we rehearse, we often systematize, relate, order, select, and 
transform information in our effort to learn. We are active rather than passive 
rehearsers. 


SUMMARY 

1. One of the oldest and most widely held assumptions in the field of learning is 
that temporal contiguity of the stimulus and the response is necessary for 
learning to occur. 

2. Some psychologists feel that S—R contiguity is sufficient for learning to occur, 
whereas others think it is necessary but not sufficient. Although the contigu¬ 
ity principle is an assumption of most existing theories of learning, its validity 
has not been well documented. 

3. The strength of an S—R association is revealed by the likelihood of a particu¬ 
lar response’s occurring in the presence of a particular stimulus situation. 

4. A stimulus can be defined as any detectable energy change. Offset and onset 
of such things as lights, sounds, odors, and tastes can serve as stimuli. 
Relationships and responses can serve as stimuli. Some energy changes are 
more effective as stimuli than others, particularly across species. An energy 
change must be attended to before it can function as an effective stimulus. 

5. The notion of a stimulus trace refers to the fact that the continuation of a 
stimulus is not dependent on continued stimulation of sensory receptors. 
Once an external stimulus has activated sensory receptors, the message is 
“taken into” the organism and maintained. Afterimages represent stimulus 
traces of an essentially sensory nature. Memories represent traces of a more 
complicated, “cognitive” nature. 

6. A response can be defined as any objectively identifiable segment of the 
continuous behavior process. 

7. In subjective organization experiments subjects are presented with a list of 
randomly arranged, unrelated words and asked to recall them in any order 
they wish. The list is presented again, but in a new random order. The 
subjects again try to recall them, and so on. Subjective organization refers to 
the fact that, as the experiment progresses, the subjects begin to recall the 
words in the same order, even though the words are presented in a new 
random order on each trial. This would seem to suggest noncontiguous 
learning. But additional research has revealed that the subjects may either (1) 
bring the elements together “in their own mind” or (2) rehearse them 
contiguously during the recall phase. 

8. Studies in the area of paired-associate learning also support the contiguity 
principle. If the time between the stimulus and response items is increased 
and the subjects are prevented from rehearsing or carrying a trace of the 
stimulus forward in time, then learning decreases. 
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9. Research reported in an earlier chapter indicated that the best CS—UCS 
interval in classical conditioning is often .5 of a second, and this supports the 
contiguity principle. Yet bait-shyness studies reveal that conditioning can 
occur with CS—UCS intervals of several hours. Rats that have been X- 
irradiated several hours after ingesting distinctly flavored solutions will still 
develop an aversion to that flavor. Some feel that this demonstrates noncon¬ 
tiguous learning. Others evoke the trace-conditioning notion to explain 
away the challenge. At present it must be admitted that the possibility of 
stimulus traces being carried forward in these experiments has not been 
effectively eliminated. 

10. Although the bait-shyness effect is provocative, noncontiguous learning has 
not been demonstrated clearly and conclusively. The constant possibility of 
confounding through the action of stimulus traces has thus far impeded our 
quest for learning in the absence of S—R contiguity. 

11. The literature on the sufficiency of S—R contiguity was also reviewed. 
Kamin’s blocking effect in classical conditioning suggests that simple S-R 
contiguity is not sufficient for learning to occur. 

12. Recent informational interpretations of classical conditioning and their sup¬ 
porting experiments also argue against the sufficiency of S-R contiguity. 

13. Studies of short-term memory indicate that materials presented contiguously 
but incidentally (without the subject’s intending to learn them) will not be 
learned. 

14. Our conclusion is that S—R contiguity may well be necessary but that it is not 
a sufficient condition for learning. 

15. The negatively accelerated learning curve is common, but it is not the only 
curve that appears in our research. 

16. So-called learning curves are actually performance curves, and their form is 
determined by both learning and motivation. 

17. We try to infer the amount of learning from a wide range of performance 
measures , including such factors as running speed, response latency, resis¬ 
tance to extinction, number of correct responses, and trials to criterion. 

18. When choosing a performance measure we must be sure it is (1) reliable and 
(2) appropriately sensitive. 

19. Because individual performance data are often so variable, we are forced to 
average the scores to obtain a clear picture of the stable trends underlying 
variable individual behaviors. 

20. Depending on our purposes, we can “smooth” out our curves by combining 
scores from a number of individuals on each trial, by combining scores from 
adjacent trials, or by combining scores obtained from a single individual run 
through the task several times. Cumulative learning curves are popular ways 
of displaying data. 

21. Although learning curves present data in a clear, simple fashion, no one curve 
can tell the whole story. A given curve merely represents one vantage point 
from which we can view our data. Averaging techniques can distort our data. 

22. The incremental and all-or-none conceptions are two theories of learning. 
Hull’s incremental system was designed to explain all behaviors of all species. 
Although the theory has been extremely influential, few psychologists have 
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been willing to accept the specificity with which Hull wished to quantify the 
conditions of behavior. 

23. Stripped of its detail Hull’s system assumes behavior to be a function of at 
least five major variables. Drive, a motivational factor, is measured in terms 
of time of deprivation or extent of noxious stimulation. Habit refers to how 
much the organism has learned about a particular situation through rein¬ 
forced responding. Incentive motivation is measured in terms of the quantity 
and quality of the reward. As stimulus intensity increases, the probability of 
the response increases. Inhibition , which detracts from the tendency to 
respond, is measured in terms of how much work is involved in making the 
response and how many times the animal has made the response. 

24. Reinforcement, in Hull’s system, refers to the rapid diminution of drive. 
Habit is a negatively accelerated function of the number of reinforced re¬ 
sponses. 

25. Hull’s theory suggests that learning, or H, is permanent and stays with the 
organism until death. Studies of electrical brain stimulation and hypnotic age 
regression suggest that many memories normally unavailable to us are still 
stored within us. Neither form of investigation proves conclusively that all 
memories are stored permanently, however. 

26. The all-or-none position is illustrated by Guthrie’s theory and Estes’ stimu¬ 
lus-sampling model. Guthrie’s theory can be summed up by saying, “A 
combination of stimuli that has accompanied a movement will on its recur¬ 
rence tend to be followed by that movement.” Acts, or the total behaviors 
that we observe and study, are made up of many movements. The gradual 
improvement in an act is the result of an increase in the number of these small 
movements, which are each learned in an all-or-none fashion. 

27. Estes’ stimulus-sampling model represents an attempt to translate Guthrie’s 
basic notions into a more formal, mathematical form. According to Estes, 
response probability equals the proportion of stimuli to which it is con¬ 
nected. “Learning” refers to the gradual growth in the proportion of stimuli 
connected to the response over successive trials. 

28. Several investigators have unsuccessfully attempted to find conclusive ex¬ 
perimental support for the all-or-none position. 

29. Multiple-trace models argue that, each time an item is presented, the item is 
represented in memory as a unique, discrete event. Strength theory maintains 
that, each time an item is presented, the strength of a single memory repre¬ 
sentation is increased. 

30. Multiple-trace theory may be closer to the all-or-none position, whereas 
strength theory may be closer to the incremental position. 

31. Much current research centers not so much on whether practice helps learn¬ 
ing but on how learning varies as the conditions of practice vary. 

32. Distributed , or spaced, practice has been shown to produce better learning 
than massed practice. 

33. Melton’s lag effect refers to the finding that the probability of recall of a 
repeated item in-free recall is positively related to the number of intervening 
items. 

34. The superiority of distributed practice and the lag affect can be interpreted in 
terms of a contextual effect or in terms of encoding variability. 
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35. The total time law , now discredited, states that the amount of learning that 
will occur in a given time interval is constant, regardless of how that time is 
spent rehearsing. 

36. Recent studies have been concerned with the quality as well as the quantity of 
rehearsal. 

37. Level of processing refers to the idea that, the deeper we process information, 
the better we will remember it. Shallow processing occurs when we attend to 
such things as the way a word looks. Deeper processing occurs when we 
attend to such things as the sound and, at the deepest level, the meaning of 
the word. 

38. Criticism of the levels-of-processing approach has been extensive. For exam¬ 
ple, some information that is processed on a shallow level can be learned well 
and remembered over long periods. 

39. The elaboration and distinctiveness hypotheses may account for depth-of- 
processing effects. 

40. Type I rehearsal is shallow practice designed to maintain information with¬ 
out transferring it to permanent memory. Type II rehearsal is deeper and 
renders information recallable in the future. 

41. Processing can occur in either an automatic or controlled fashion. 

42. Rehearsal is more likely to involve some type of elaboration than it is to 
involve simple rote rehearsal of the material. This holds for lists or groups of 
material as well as for single items. 
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DEFINITIONS AND PARAMETERS OF REINFORCEMENT 
On an operational level there is very little disagreement about the nature of 
reinforcers. Reinforcers are defined in terms of their effects. Any stimulus is a 
reinforcer if it increases the strength of a response. You will recall that we dis¬ 
tinguished between positive and negative reinforcers in Chapter 2. A positive rein¬ 
forcer is anything that, when presented to the animal, will increase the probability 
of the response’s occurring (for example, food). A negative reinforcer is anything 
that, when taken away from the animal, increases the probability of the response 
(for example, shock). Reinforcers, both positive and negative, increase response 
probability. 

That seems to be the limit of common agreement; at this point opinions 
concerning the nature of reinforcement begin to diverge. As soon as we ask why 
reinforcers reinforce, we lose any semblance of unanimity. We have already seen, 
for example, that Hull (1943) thought of reinforcement in terms of the rapid 
diminution of drive level. But Thorndike (1913) earlier defined reinforcement and 
punishment in terms of “satisfying” and “annoying” states of affairs. According 
to Thorndike, S—R connections are strengthened if the occurrence of the response 
is followed by a satisfying state of affairs and weakened if it is followed by an 
annoying state of affairs. 

With these brief examples we can begin to see the extent of the controversy 
surrounding the theoretical nature of reinforcement. We shall return to the 
question of theory later in this chapter. For now, let us direct our attention to 
some empirical questions that do not require the adoption of one or the other of 
these various theoretical positions. We shall first look at an outline of the more 
important parameters of reinforcement. 

Imagine a hungry rat in a Skinner box. Imagine a student in a college course. 
Each organism can be induced to acquire more or less complicated responses 
through the application of certain reinforcers. The rat will work for food, where¬ 
as grades, praise, or satisfaction may be sufficient to reinforce the student. We can 
vary many attributes, or characteristics, of reinforcing situations. One of the most 
obvious factors is the amount of reward presented immediately following the 
occurrence of a response. The rat might receive one, two, three, or more pellets 
following a correct response. The student might be heartily praised for a correct 
response, or she might receive nothing more than a curt nod from the instructor. 
It is not unreasonable to suspect that these variations would be important in 
learning. A second major factor has to do with the time between the occurrence of 
the response and the receipt of the reward, or the delay of reinforcement. The rat 
might receive its reinforcement immediately after making the response, or it might 
be forced to wait 10 seconds, 20 seconds, or even longer before receiving the 
reward. Similarly, the student might be praised immediately, or the teacher might 
wait until after class to deliver the praise. Finally, we can reinforce every response 
our subject makes, but we need not. We can partially reinforce the subject, or 
reinforce something less than every response. For example, we might want to set 
up a schedule of reinforcement , in which we only reinforce every second, third, 
fourth, or nxh response, leaving the remainder of the responses unreinforced. 
Intuitively, one would expect learning to be strongly affected by manipulations 
involving the schedule of reinforcement. 
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These three factors (amount, delay, and schedule) have emerged as critical 
parameters of reinforcement. We shall consider each of them individually. 

SCHEDULES OF REINFORCEMENT 

Continuous reinforcement (CRF) refers to situations in which every response is 
reinforced. As we have noted, we need not reinforce every response our labora¬ 
tory subject makes. We are free to present reinforcements in any pattern we wish 
and can reinforce any portion of the occurring responses. In the real world 
continuous reinforcement is probably the exception rather than the rule. When 
kittens are learning to pounce on moving objects, there are far from completely 
successful in their early attempts. As youngsters learn to play basketball, they 
miss more baskets than they make. And yet these behaviors are learned. Con¬ 
tinuous reinforcement is not necessary for learning to occur (even though it is 
often used in the laboratory). The study of schedules of reinforcement is con¬ 
cerned with the impact of these patterns of partial reinforcement on the acquisi¬ 
tion of behaviors and represents the formalized investigation of everyday patterns 
of reinforcement and their effects on behavior. 

The Major Types of Schedule 

FIXED-INTERVAL SCHEDULE. In a fixed-interval (FI) schedule the animal is rein¬ 
forced for the first response that occurs after the end of a given time interval. For 
example, in an FI 3 schedule the animal is reinforced for the first response that it 
makes after 3 minutes have elapsed since its last reinforced response. It must then 
wait another 3 minutes before it can obtain another reinforcement. It is free to 
respond during the 3-minute intervals, but it will not be reinforced for these 
responses. The very first response following the end of the interval is the only one 
that is reinforced. 

An FI schedule of reinforcement leads to a very specific kind of responding. The 
animal develops a response strategy to fit the reinforcement schedule. A reason¬ 
able strategy for an FI schedule would be to pause after the receipt of the 
reinforcement, as there is no need to respond continuously, and then, as the end of 
the interval approaches, respond more and more frequently. This is exactly the 
kind of responding we observe in laboratory animals. Long pauses follow rein¬ 
forcements. The rate of responding accelerates as the end of the interval 
approaches. As can be seen in Figure 6.1, a characteristic “scalloping” effect is 
observed in the cumulative records of FI schedules (see Innis, 1979; Madigan, 
1978; Zeiler & Buchman, 1979). 

In a very rough way the studying behavior of students sometimes follows this 
pattern. Suppose that a given class has a midterm exam and a final exam. These 
exams are reinforcing events (positive or negative, depending on how you look at 
them). Many students do very little studying as the semester begins. As the 
midterm approaches, studying behavior picks up until it reaches a high rate just 
before the exam. After the exam it drops to zero and remains there, only to 
accelerate again as the final exam approaches. Obviously, not all students study in 
this manner, and obviously, not all materials are handled in this fashion (see 
Figure 6.2). But there is an interesting, and quite striking, correspondence be- 
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Figure 6.1 Stylized cumulative-response records obtained under four common schedules 
of reinforcement. Slash marks in the records indicate presentations of reinforcements. 
(Adapted from Operant Learning , Procedures for Changing Behavior by J. L. Williams. 
Copyright © 1973 by Wadsworth, Inc. Reprinted by permission of the publisher, 
Brooks/Cole Publishing Company, Monterey, California.) 

tween the effects of FI schedules and the effects of our traditional examination 
patterns. 

It might be argued that “mail-checking behavior” also resembles behavior 
under the control of an FI schedule. If we know that the mail arrives promptly at 
noon, we do not begin to look for it at 8 A.M. As noon approaches, we begin to 
look, listen, and check the mailbox. After the delivery arrives, there is very little 
chance that we will check again until the next day. 

As a final example, Weisberg and Waldrop (1972) .have presented an amusing 
analysis of the bill-passing behavior of the U.S. Congress. As can be seen in Figure 
6.3, Congress passes very few bills during the first few months of a given session. 
As the time for adjournment approaches, the rate of bill passage accelerates—in a 
pattern just like that of a rat in a Skinner box, scalloping and all. Actually, the 
similarity between the two situations may be more fanciful than real, for we know 
very little about the pressures on Congress that produce this pattern of behavior. 
As Lundin (1974) suggests, the scalloping could result from the increasing de¬ 
mands of various lobbies, special-interest groups, and powerful constituents. 

THE VARIABLE-INTERVAL SCHEDULE. In a variable-interval (VI) schedule the 
animal is again forced to wait before it can obtain a reinforcement, but the time 
interval is variable. After the animal has responded and been reinforced, it might 
have to wait for 5 minutes before being able to receive a reward. Then it might 
have to wait for 3 minutes, and then 30 seconds, and so on. A VI 2 schedule is one 
in which the average time interval between reinforcement is 2 minutes (see 
Bushnell &c Weiss, 1978). 
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Figure 6.2 Hypothetical studying patterns. Although examination patterns often resemble 
a fixed-interval schedule, and the text argues that scalloping often appears in students’ 
studying behavior, we have all met exceptions to the rule. Some people never stop 
studying, some only turn to the books just before the exam, some keep up a steady rate 
throughout the semester, and some never study at all. In these cases the individual is 
probably responding to some alternative set of reinforcers and some other, more 
powerful, schedule. For example, the person who does not study at all may be rewarded 
for doing so by his peers. The crammer may have had great success with this technique in 
the past. The continuous studier may find relief from anxiety by constantly studying. 

It seems that steady responding is most efficient in this case (see Figure 6.1). 
Because the animal is unable to judge the length of the time intervals, it cannot 
pause as it does in the FI situation. It must constantly “check in” with a steady 
stream of responses. As one would expect, the response rate is high when a short 
average interval is employed and lower when a longer average interval is utilized. 
The studying behavior of students taking a class in which the instructor gives pop 
quizzes or surprise exams will show these characteristics. 

THE FIXED-RATIO SCHEDULE. In a fixed-ratio (FR) schedule (see Hobson, 1975) 
a certain number of responses must be made before the reward is delivered. In an 
FR 10 schedule a pigeon must peck a key 10 times before being reinforced. This 
schedule yields extremely high rates of responding (see Figure 6.1). The animal 
responds rapidly until it receives its reinforcement. Interestingly, the FR schedule 
often produces a mild postreinforcement pause. It is not nearly so pronounced as 
the scalloping effect obtained with FI schedules, but it is a definite characteristic 
of the FR schedule. The higher the ratio, the longer is the postreinforcement 
pause. 

This postreinforcement pause has sometimes been called a procrastination 
effect (Domjan & Burkhard, 1982). For example, if you are writing a term paper 
composed of several sections, you may work rapidly until you have completed 
one section. Then you may heave a sigh of relief, take a break, and procrastinate 
for a while before you begin the next section. 
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Figure 6.3 Fixed-interval work habits of Congress. (Adapted from Weisberg, P., and 
Waldrop, P. B. Fixed-interval work habits of Congress. Journal of Applied Behavior 
Analysis , 1972, 5, 93-97. Fig. 1, p. 94.) 

Employees working on the assembly line in a factory find themselves required 
to process large numbers of units for a certain amount of pay. They are working 
under an FR schedule. In the case of a “speedup,” unscrupulous employers 
require a certain number of units for a certain amount of pay. Once that level of 
responding is obtained, they run the assembly line faster, forcing the workers to 
produce more and more for the same amount of pay. 

Extremely high fixed ratios can be established by gradually increasing the 
number of responses required for a reward, but this cannot be done all in one 
step. Findley and Brady (1965) describe an experiment in which 120,000 re¬ 
sponses were required before a chimpanzee received food. This is an extreme 
example, of course, and most situations both inside and outside the laboratory 
involve much smaller ratios. 

THE VARIABLE-RATIO SCHEDULE. In the variable-ratio (VR) schedule the number 
of responses required of the subject varies. A rat may first be required to press a 
lever 10 times for a reward. Then 3 responses may be required, then 7, then 4, and 
so on. In a VR 20 schedule the average number of required responses is 20. This 
schedule also produces high rates of responding (see Figure 6.1). The principal 
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difference between this schedule and the FR schedule is the absence of the 
postreinforcement pause in the VR situation. Variable-ratio responding is strong 
and steady. 

The most striking example of VR scheduling outside the laboratory comes from 
the world of gambling. When the vacationing schoolteacher stands in front of a 
slot machine in Las Vegas punching in nickels for hours on end, he is in the grip of 
a VR schedule. The machine is programmed to pay off after irregular numbers of 
responses. Some machines pay off quite frequently, especially in casinos where the 
customer’s quick return is desired. Jackpots obtained all over the casino are 
announced over a loudspeaker in order to create the impression of a more 
favorable ratio. Slot machines located in spots where the customer is unlikely to 
return (for example, airports, bus terminals) are probably programmed to pay off 
only rarely. 

Every year, the Girl Scouts come around with their boxes of cookies. Every 
year, eager little Cub Scouts show up at the door with tickets to the annual 
Scout-o-Rama. These children may be under the spell of a VR schedule. They 
must ring door bells to sell, but they do not know how many they will have to ring 
before they make a sale. Nor will sitting at home waiting for the end of an interval 
make a sale. They have to make variable numbers of responses to sell. What 
makes it worse is that they know that every other member of their organization is 
out there doing the same thing, raising the ratio minute by minute, in a highly 
competitive situation. 

CONTINUOUS VERSUS PARTIAL REINFORCEMENT. It seems strange in a way, but 
partial-reinforcement schedules sometimes produce higher rates of responding 
than does continuous reinforcement. For example, Stephens, Pear, Wray, and 
Jackson (1975) rewarded mentally retarded children for learning the names of 
pictures. They told the children “good” and gave them candies as reinforcers. Part 
of the time the children were reinforced for every correct response they made 
(CRF), and part of the time they were placed on an FR schedule. As is shown in 
Figure 6.4, four out of five children definitely responded more often under the FR 
schedule, with the fifth child responding so seldom as to make a schedule 
comparison difficult. 

Although this study was done with mentally retarded children, partial sched¬ 
ules often seem to produce greater response rates in other humans and animals as 
well. (It should be noted that it may be best to begin learning under continuous 
reinforcement and then shift to partial reinforcement to attain maximum re¬ 
sponse rate. Continuous reinforcement is better to “get the organism started,” 
but once responding has begun, shifting to a partial schedule will yield the 
highest rate.) 

MULTIPLE SCHEDULES. Multiple schedules consist of two or more simple sched¬ 
ules presented one after the other (see Norborg, Osborne, & Fanting, 1983). In 
addition, distinctive cues, such as colored lights, are associated with or presented 
during each of the simple phases. For example, suppose that a pigeon is pecking a 
key for food under a mult FI 3 VR 10 schedule. What this means is that it is first 
trained under an FI schedule. Some cue, such as a red light, is present while the 
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Figure 6.4 Partial reinforcement yields higher correct response rates than continuous 
reinforcement. Retarded children were reinforced with candy and a verbal “good” either 
continuously (CRF) or partially (FR 5 and FR 12) for naming pictures. The FR schedules 
produced higher correct response rates. (Adapted from Stephens, C. E., Pear, J. J., Wray, 

L. D., and Jackson, G. C. Some effects of reinforcement schedules in teaching picture 
names to retarded children, journal of Applied Behavior Analysis , 1975, 5, 435—447. 

Fig. 1, p. 439.) 

bird is undergoing training. When the schedule is shifted to VR 10, the red light is 
replaced by, say, a green light. Each time the schedule is changed, the cue is 
changed. 

As might be expected, the animal shifts its pattern of responding as the schedule 
shifts. But it is not a simple situation. The animal’s responding under any given 
schedule is often affected by the nature of the preceding schedule. Multiple 
schedules involve complicated interaction effects. The nature of these complex 
interactions has recently come under rather intense investigation (Green & Rach- 
lin, 1975; Keller, 1974; Nevin, 1974a, 1974b; Richards, 1974). 

The mixed schedule is a variation of the multiple schedule in which no external 
cues (such as colors) are associated with the schedules. The shift from one 
schedule to another can be detected only from the changed patterns of reinforce¬ 
ment. As expected, this often reduces the ease or efficiency with which the animal 
shifts from one form of responding to the next. 

CHAINED SCHEDULES. In chained schedules (Miller, 1975) two or more succes¬ 
sive schedules must be completed before a reinforcement can be obtained. For 
example, in an FI 5 FR 10 chained schedule the animal must first respond after 5 
seconds have elapsed and must then give 10 responses, regardless of their timing, 
before it will be reinforced. 

In chained schedules external stimuli (such as colors) are associated with each 
successive phase in the chain. Tandem schedules (Weisman Davis, 1975) are 
the same as chained schedules, except that there are no such changes in external 
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stimuli when the successive phases are completed. Thus, in tandem schedules the 
animal is not provided with information concerning the completion of successive 
phases. 


Concurrent Schedules and the Matching Law 
Concurrent schedules are those in which the animal is faced with two or more 
levers or keys. Each key operates according to its own schedule. The animal is free 
to press any key (Rider, 1979; White, 1979). Thus, in a concurrent FI 2 FI 8 
schedule, responses to one key are reinforced every 2 seconds, and responses to a 
second key are reinforced every 8 seconds. 

The matching law (Herrnstein, 1970) refers to the fact that the relative frequen¬ 
cy of responding on a key matches the relative frequency of reinforcement for 
responses on that alternative. In other words, the animal responds most often to 
the key, or lever, that delivers a reinforcement most often, and the responses are 
made in direct proportion to the frequency of that reinforcement. 

Matching behavior has been demonstrated many times with animals (see Real 
&£ Hobson, 1983). But we humans display matching behavior, too. For example, 
Conger and Killeen (1974) told subjects that they would be talking with three 
other people. Actually, these three people were confederates of the experimenter 
who had been told to reinforce the subject according to different schedules. 
Reinforcement consisted of making positive remarks and so on. The matching 
law was supported by the fact that the relative amounts of time the subjects spent 
freely choosing to talk to each of the three confederates matched the relative rates 
of reinforcement delivered by the confederates. 

Why do animals, and humans, display matching behavior in choice situations? 
What is behind this intriguing piece of behavior? At present the answer is not 
clear, and the issue is being hotly debated (see Green, Rachlin, & Hanson, 1983; 
Hinson & Staddon, 1983). The most likely candidate is expressed as the max¬ 
imizing hypothesis. According to this hypothesis, we display matching behavior 
in order to maximize the number of reinforcements we get for the number of 
responses we make. We switch back and forth in order to get the maximum 
number of reinforcements. It’s an intuitively appealing idea and seems almost 
self-evident. Its simplicity is apparently deceptive, however; it is not known 
whether matching is determined by an effort to maximize reinforcements or by 
some other as yet undetermined factors. 

Schedule-Induced Behavior 

Partial-reinforcement schedules not only increase the strength of the responses 
that are being reinforced but also stimulate so-called schedule-induced behavior 
(Falk, 1971; Muller, Crow, & Cheney, 1979; Yoburn & Cohen, 1979). What this 
means is that, during those intervals when no reinforcement is possible, both 
animals and humans will begin to develop certain behaviors such as pacing back 
and forth, clenching the jaw, and (in the case of some animals, at least) attacking 
pictures of members of one’s own species. Other schedule-induced behaviors 
include biting, eating, drinking, running, and licking. None of this behavior is 
reinforced by the schedule in effect, and yet it does develop, and quite strongly. 
Muller, Crow, and Cheney (1979) find that these behaviors are “not readily 
explained by current principles of learning.” 
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Figure 6.5 Hypothetical data relating time between reinforcements to schedule-induced 
behavior. 

But the situation may not be that mysterious. It may be that these schedule- 
induced behaviors are merely being reinforced by factors beyond the control of 
the experimenter. For example, if you were on an FI 10 schedule when the reward 
was 50#, you might well develop some pacing or jaw-clenching behaviors during 
the interval to (1) ward off boredom, (2) reduce tension and frustration, and (3) 
receive some kind of stimulation. In other words, these behaviors may be nothing 
more than responses reinforced by subtle but strong reinforcers. 

One thing can be said with certainty about schedule-induced behavior: It varies 
as a function of the length of time between reinforcements. As you can see in 
Figure 6.5, these “extra” behaviors first increase and then decrease as the time 
between reinforcements increases. Exactly why this relationship exists is not 
known. 


Autoshaping 

There is an interesting variation on the theme of schedules of reinforcement that 
may turn out to be extremely important. Autoshaping (Brown & Jenkins, 1968; 
Gamzu & Schwam, 1974; Jenkins &C Moore, 1973; Pisacreta, Redwood, & Witt, 
1983; Wasserman & Molina, 1975) refers to what happens when reinforcement is 
presented, regardless of whether the animal responds. (For related work see 
Skinner’s [1948] early, classic analysis of so-called superstitious behavior.) In a 
typical autoshaping experiment the illumination of a key is always followed by 
delivery of food. Thus, a hungry pigeon need not respond at all in this situation. It 
need only wait until the key lights up and then eat the food, which always follows 
the light. And yet it has been observed that the pigeon does peck the key. The 
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key-pecking response develops even though it is not necessary for the receipt of a 
reward. 

This effect, although quite controversial (see Wessells, 1974), calls into ques¬ 
tion the very nature of what is normally called instrumental conditioning. What it 
suggests is that there is a strong element of classical conditioning in the typical 
instrumental-conditioning situation. The UCS is the food. The food (UCS) auto¬ 
matically elicits a pecking response (the UCR). The UCS (food) is paired with a 
CS (the light). After a series of such pairings, the CS (light) comes to elicit a CR 
(pecking in response to the light), which resembles the UCR (pecking in response 
to the food). 

This all goes back to the problem that was outlined in an earlier chapter. The 
distinction between classical and instrumental conditioning is difficult to main¬ 
tain. The autoshaping effect suggests that classical conditioning is so intertwined 
with even the simplest instrumental-conditioning situation that it is impossible to 
conclude that we ever have a pure case of classical or instrumental conditioning 
(see also Brandon & Bitterman, 1979; Durlach, 1983; Gibbons, Baldock, Locur- 
to, Gold, & Terrace, 1977; Rescorla, 1979). 

Extinction: The “Partial-Reinforcement Effect” 

Thus far we have been speaking of the relationship between schedules of rein¬ 
forcement and the acquisition process. We now shall look at how extinction is 
related to partial reinforcement. One of the most widely noted phenomena in the 
field of learning is the fact that resistance to extinction following partial reinforce¬ 
ment is greater than resistance to extinction following continuous reinforcement 
(Lewis, 1960; Platt & Day, 1979; Seybert, Baer, Harvey, Ludwig, tk. Gerard, 
1979). All four of the simple schedules (VI, FI, VR, FR) produce greater resistance 
to extinction than does continuous reinforcement (CRF). Although we will not 
deal with these points in detail, it should be noted that each of the four simple 
schedules produces its own characteristic pattern of responding during extinction 
(see Williams, 1973). What we are primarily concerned with here is the fact that 
all partial schedules yield greater resistance to extinction that does CRF—that is, 
the partial-reinforcement effect (PRE). 

If you reinforce a rat only part of the time and then remove reinforcement all 
together, the rat will go on responding during this extinction phase longer than it 
would if it had been reinforced 100% of the time. Humans display the same effect 
in everyday life. For example, suppose a child asks his mother for a piece of candy 
and she says no. Then he asks again and again. Finally, in exasperation, she gives 
in and reinforces him with candy. What may happen if this pattern persists is that 
the child will become something of a demanding brat. He never seems to give up. 
He has been partially reinforced and is thus less likely to give up his begging, 
nagging behavior than is a child who is refused candy after having been given 
candy each and every time he asked for it. 

THE DISCRIMINATION HYPOTHESIS. There are three interpretations of the PRE. 
According to the first of these, the discrimination hypothesis (Mowrer & Jones, 
1945), subjects on a partial reinforcement schedule may be less likely to notice 
that anything has changed at all when they are put on an extinction schedule. 
They are used to a lot of trials without reinforcement and may, at (east temporar- 
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ily, fail to discriminate between partial reinforcement and extinction. Con¬ 
tinuously reinforced subjects, by contrast, may much more easily notice the 
differences between CRF and extinction and may thus stop responding sooner. 

Although intuitively appealing, the discrimination hypothesis has not been well 
supported experimentally (Gibbs, Latham, & Gormezano, 1978; Theios, 1962). 
The next two interpretations have met with more success. 

FRUSTRATION THEORY. Amsel (1958) has proposed another interpretation of 
the PRE. According to this position, partially reinforced animals experience 
frustration when they make a response and are not reinforced. This emotion 
becomes classically conditioned to the learning situation (for example, a runway 
and goal box). But because they are reinforced part of the time, they do learn the 
response and, more importantly, learn to make the response in the presence of this 
conditioned emotional frustration. They are “used to” feeling frustrated and so 
continue to respond during extinction. Now consider the continuously reinforced 
animal that is suddenly put on extinction. It has never been frustrated during 
acquisition; therefore, it has not learned to respond in the presence of frustration. 
The frustration it feels during extinction is disruptive and facilitates extinction. 

SEQUENTIAL THEORY. The final interpretation of the PRE is called the sequential 
theory. Capaldi (1971) proposed that animals can remember whether they were 
reinforced on the preceding trial and that it is this memory of not having been 
rewarded recently that motivates them to respond on the present trial. If you were 
in the rat’s place you might say to yourself, “I know my responses are rewarded 
some of the time, and I haven’t been rewarded recently, so it’s probably time for a 
reward. I guess I’ll respond now.” 

Both the sequential theory and the frustration theory have stimulated a good 
deal of supportive research. Both mechanisms may contribute to the PRE. At 
present neither theory seems to be completely correct. 

Schedules: Humans and Animals 

Most of the schedules and data we have discussed have been developed using 
“lower” animals. It may legitimately be asked if the same effects will always be 
obtained with humans. The answer varies, depending on the psychologist whom 
you consult and the task involved. The position taken in this text is that there is 
quite good, though not perfect, correspondence between the patterns obtained 
with animals and humans. For example, Yukl, Wexley, and Seymore (1972) 
reinforced the scoring of IBM answer cards with either continuous reinforcement 
or a VR 2 schedule. The pattern of their results tended to agree with the data 
obtained in animal experiments. Using mentally deficient children, candy as 
reinforcement, and a variety of schedules, Orlando and Bijou (1960) found the 
behaviors generated by their selected schedules were quite similar to those 
obtained with experimental animals. Verplanck (1956) presents data suggesting 
that normal college students act very much like experimental animals when 
subjected to various scheduling manipulations (see also Davidson &c Grayson 
Osborne, 1974; Karen, 1974). Thus, there is substantial evidence for the notion 
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that we are learning something about humans when we manipulate schedules of 
reinforcement with animals. Of course, there are exceptions to this tidy corre¬ 
spondence between animal and human behavior. In investigating human choice 
behavior, for example, Schmitt (1974) has been led to suggest that humans and 
animals differ quite markedly in terms of their responses to variations in rate of 
reinforcement. 

Does the partial-reinforcement effect appear with humans as well as with 
experimental animals? The answer is that the available results are inconclusive. 
On the positive side, we have reports that resistance to extinction is greater 
following partial reinforcement when the subjects are children and the responses 
are such things as pressing down the nose of a clown and placing rubber balls in 
holes (Bijou, 1957a, 1957b). Furthermore, Brackbill (1958) found learned smiling 
responses in infants to be more resistant to extinction following partial reinforce¬ 
ment (see also Nation & Massad, 1978). But others (see Keppel, Zavortink, & 
Shiff, 1967) have failed to find PRE in human situations. In summary, it can be 
argued that the similarities between human and animal responses to schedule 
manipulations outweigh the differences. In view of the complex unpredictability 
on the part of humans it is not surprising that our animal and human data do not 
always look alike. Failures to find similarities between animal and human be¬ 
havior may not result from any basic differences in the underlying mechanisms of 
learning. They may stem from our ignorance of the multiple interests and pur¬ 
poses of the complex human. 

DELAY OF REINFORCEMENT 

The investigation of delay of reinforcement began decades ago (see Skinner, 1938; 
Wolfe, 1934) and continues today (see Richards, 1974; Richards & Marcattilo, 
1978; Thomas, Lieberman, McIntosh, & Ronaldson, 1983). If the time between a 
response and the subsequent delivery of the reinforcement is increased, learning 
tends to be inhibited. If a rat is forced to wait for its reward after each lever press, 
for example, it will learn the response much less quickly than if it were reinforced 
immediately after each response. If children are rewarded immediately after 
picking up their toys, they will be more likely to learn than if they were forced to 
wait until the end of the week for their reward. If the reward is delayed too long, 
learning may not occur at all. 

The relationship between delay of reinforcement and learning tends to assume 
a characteristic form. Figure 6.6 contains several stylized delay-of-reinforcement 
gradients. Two things should be noted about these gradients. First, performance 
drops off rapidly as the delay increases. Second, even though it does drop off, 
some performance gains do occur even under conditions of delayed reinforce¬ 
ment. Most of the experimental work has involved delays from several seconds to 
several minutes. Under some conditions delays of a few seconds can eliminate 
learning, whereas under other conditions long delays have little effect on perform¬ 
ance. Let us now consider why these differential effects appear. 

The Role of Secondary Rein forcers 

Over the years a little story has developed concerning delay-of-reinforcement 
effects. In a very early experiment Watson (1917) trained rats to dig down through 
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Figure 6.6 Some hypothetical delay-of-reinforcement gradients. 


sawdust to reach a cup containing food. Half of the animals were permitted to 
consume the reward immediately. The other half were required to wait for 30 
seconds before eating the food. Surprisingly, Watson found no differences be¬ 
tween the two groups in terms of their digging performance. Delaying the rein¬ 
forcement for 30 seconds did not hinder learning in the slightest. One obvious 
reason for this result has to do with the fact that the delayed animals waited in the 
vicinity of the food cup. They could smell the food and see the cup, which they 
associated with food. In other words, the presentation of the cues that the animal 
associated with food (smell, sight) probably reinforced the digging response 
immediately. 

These cues (smell, sight of cup) that the animal associated with the primary 
reinforcer (food) are called secondary reinforcers. A secondary reinforcer is a 
neutral stimulus that, through constant association with primary reinforcers, 
acquires reinforcing properties itself. For example, money is a secondary reinforc¬ 
er. Green rectangles of paper are, in and of themselves, of little value to us. And 
yet many of us will work madly to obtain them. The green paper has acquired 
reinforcing properties because it has been associated or paired with more primary 
reinforcers (food, warmth, drink, sleep, sex, comfort, and the like). 

We will consider secondary reinforcers in detail in a later section. For now it is 
sufficient to understand that secondary reinforcers have been heavily implicated 
in delay-of-reinforcement effects. Over the years a hypothesis concerning delays 
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and secondary reinforcers has emerged. It is argued that immediate reinforcement 
is necessary for learning to occur. If an experiment demonstrates a performance 
increase when primary reinforcers are delayed, then that performance increase 
must result from the immediate, and not delayed, delivery of secondary rein¬ 
forcers. 

This is an interesting hypothesis, and to test it we could try to eliminate 
potential secondary reinforcers from our experiments with delaying reinforce¬ 
ment. Performance under conditions of delay should then be reduced. With 
successive eliminations of additional secondary reinforcers we should note a 
continuing drop in our gradient, such as the shift from gradient 1 to 2 to 3 in 
Figure 6.6. 

Just such an experimental strategy has developed in the literature. A well- 
known group of studies (for example, Grice, 1948; Perin, 1943; Perkins, 1947) 
was concerned with the removal of contaminating secondary reinforcers in delay- 
of-reinforcement experiments. An example of their strategy will suffice. Wolfe 
(1934) trained rats to choose one arm (left or right) in a T maze. Once the rats 
were released from the starting box, they were allowed to proceed to the choice 
point and to turn and move in the direction of the goal box. But before reaching 
the goal box the animals were held in a delay chamber for varying amounts of 
time. Then they were released and allowed to proceed to the end of the arm. 
Wolfe found that rats could be delayed as long as 20 minutes and still learn the 
response. Perkins (1947) called attention to a very important aspect of this 
situation. When the rat turns in the correct direction, it is always held in a very 
distinctive delay box. The box might not seem distinctive to us, but to a confined 
rat the environment must surely be so. It might contain a paint smudge here, a 
unique crack there, or perhaps slight variations in texture. It is this distinctive 
delay box that is always followed by reward. Hence, the delay box must become a 
secondary reinforcer. Every time the animal chooses the correct arm and is held in 
the distinctive delay box, it is reinforced immediately by the presence of the box. 
If it chooses the incorrect arm, it is confined in another distinctive delay box 
(different paint smudges, smell, and so on). This chamber is never associated with 
or followed by a reward. Hence, it does not become a secondary reinforcer. 

To remove the distinctive delay box as a source of secondary reinforcement 
Perkins (1947) merely switched delay boxes on successive trials. Thus, neither box 
was consistently associated with either reward or absence of reward. Under these 
conditions the gradient fell. Less learning was observed under conditions of delay 
when the secondary reinforcer was removed. 

Additional sources of secondary reinforcement (for example, proprioceptive 
cues, or “feelings,” associated with making aleft or a right turn) were eliminated 
in subsequent experiments. As each successive source of secondary reinforcement 
was removed, the gradient fell an additional step. Hence, for many (see Spence, 
1947) it has been assumed that learning might not occur at all in the absence of 
some kind of immediate reinforcement. According to this position, either primary 
or secondary reinforcement must occur immediately after the occurrence of the 
response. But the issue has yet to be resolved to everyone’s satisfaction. More 
recent experimentation has suggested that some learning may, after all, occur in 
the absence of any immediate reinforcement (Lett, 1973; Lieberman, McIntosh, 
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& Thomas, 1979; Revusky, 1971; Revusky & Garcia, 1970; Rozin & Kalat, 
1971). These results have been obtained in both aversive-conditioning and T- 
maze situations. 

Thus, it is really too early to tell where this story will end. The most reasonable 
position to take is that secondary reinforcers are often involved in delay-of- 
reinforcement experiments and that their removal does .reduce learning under 
conditions of delayed primary reinforcement. Nevertheless, there may well be 

situations in which learning can occur when reinforcement is truly delayed. 

\ 

Humans and Delay of Reinforcement 
Given that the situation is confusing with respect to animal studies, what can we 
possibly hope to say about human learning and delay of reinforcement? One 
thing that can be said with some certainty is that, if secondary reinforcers are 
involved in animal experiments, they are enormously involved in human delay 
situations. Humans are capable of performing very well under conditions of 
extreme delay. Presumably many of the readers of this text are plowing on 
without the hope of anything remotely resembling a primary reward in the 
foreseeable future. And yet they keep after it, slogging away, presumably being 
maintained by the presence of all sorts of secondary reinforcers (such as congratu¬ 
lating themselves on having beaten down one more chapter). Any sort of long- 
range human learning endeavor (for example, pursuit of career goals) is probably 
maintained through the involvement of secondary reinforcers. 

Experimental work supports the notion that reinforcement delays (for exam¬ 
ple, delaying feedback on performance) are not particularly effective in reducing 
human learning (Bilodeau, 1966; Brackbill, Adams, & Reaney, 1967). Most 
studies show that humans are capable of learning in spite of long delays. 

What it may come down to is that humans are generally better able to bridge 
time gaps than animals. For example, language allows us to reinforce ourselves on 
the completion of any given response even though the primary reward may be 
delayed. We can think ahead, see the relationship between our current action and 
our ultimate goal, and thereby reinforce ourselves “ahead of time.” 

AMOUNT OF REINFORCEMENT 

What is the relationship between amount of reinforcement and performance 
level? Will a rat’s running speed vary as a function of the amount of food it is 
given at the end of the runway? Will children do better in school if they receive 
more praise from the teacher? These are the kinds of question that prompt a 
consideration of the relationship between amount of reward and performance. 

The answers we have been able to obtain in the laboratory are not as clear cut 
as we would like them to be. Studying the effects of variations in the amount of 
reward has been difficult over the years. In a very general, oversimplified way we 
can suggest that performance increases as the amount of reward increases (see 
Gallistel, 1978; Osborne, 1978). The greater the amount of reward, the better is 
the performance. Beyond a certain point further increases in the magnitude of the 
goal may fail to result in further increases in performance. 

Components of the Situation 

Two early and widely quoted studies relating performance to amount of reward 
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were done by Crespi (1942) and Zeaman (1949). Zeaman had rats run down a 
3-foot runway to a goal box. Different groups of rats received different amounts 
of cheese (.05, .2, .4, .6, .8, 1.6, or 2.4 grams) once they reached the goal box. 
Zeaman found that the latency of the running response (time required to leave the 
start box) decreased as the magnitude of the reward increased. These and similar 
studies led to the general notion that performance increases as the amount of 
reward increases (see Wielkiewicz, 1979). But as psychologists continued their 
examination of the amount-of-reward variable, they began to realize that it is 
quite a complicated problem and that there are at least three distinct components 
to the amount-of-reward situation. 

First, one must consider the mass or volume of the reinforcer (for example, 
grams of food ingested or volume of water consumed). Second, the amount of 
consuming activity the animal engages in may be important (number of licks, 
pecks, or swallows). Third, one must consider the quality as well as the quantity 
of the reward (for example, sweet versus neutral flavor). Two or more of these 
factors are often confounded in a given experimental situation. If we merely vary 
the number of pieces of corn that we give to pigeons, for example, we have 
confounded volume of food with amount of consuming activity (one peck per 
kernel) (see Figure 6.7). 

Much of the work in this area has been designed to tease out the independent 
effects of these various components. For example, one way to separate the 
consuming component from the ingested-volume component is to surgically sever 
the animal’s esophagus. In this way liquids can be introduced directly into the 
animal’s stomach, thereby eliminating the potentially confounding effects of 
consuming activities. Similarly, liquids can be consumed only to emerge from the 
opening in the esophagus, thereby allowing the measurement of the effects of 
consuming activity independent of the effects of ingested volume (Mook, 1963). 
Another way to study the independent effects of consuming behavior is to divide a 
given amount of reward into varying numbers of parts. Thus, consuming activity 
(number of pecks or licks) can vary even though the total volume of ingested 
material is constant (Hall & Kling, 1960; Hulse, 1960). The quality-versus- 
quantity issue can be examined by simultaneously varying the volume of the 
reward (small, medium, large) and the quality of the reward (citric, basic, sac¬ 
charin) (Collier, 1962; Hutt, 1954). There have even been some attempts to 
explore the complicated effects of variations in quantity and quality of reward in 
human subjects (for example, Weiner, 1966). 

In general, it can be said that all three components (quantity, quality, and 
consuming activity) appear to contribute to the reinforcing effect in a positive 
fashion. At present it is impossible to determine which of the three is the most 
influential, although one guess would be that the quality is most important, 
followed by quantity and consuming activity, in that order. The student interested 
in a detailed discussion of these effects is referred to Kling and Schrier (1971) and 
Osborne (1978). 


Reinforcement Contrast 

An interesting phenomenon has been discovered within the context of studying 
amount of reward. Suppose that you have a summer job picking fruit. You are 
paid 25 <t for each basket you fill. For 3 weeks you work furiously.'Then the boss 
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Yummy 

Orange! 

For over a decade we have been leaders in 
purified and semi-purified biological test diets for 
laboratory animals. Now, for behavioral studies, 
we are introducing a complete line of standard 
size, precision food pellets in a wide variety of 
formulas and flavors. Everything from al¬ 
falfa to pineapple. Delicious and yet 
they meet the nutritional re¬ 
quirements of your animals. 

Let us send you in¬ 
formation on our new 4 

line of precision _„ 

food pellets! 


PO. BOX B S, FRENCHTOWN t'NEW JERSEY 08825/20V996-2155 


Figure 6.7 Precision comes to amount of reward. Even though psychologists may be 
confused about the importance of amount of reward, big business is not. In contrast to 
earlier times, we are now offered precision-made multiflavored food pellets in many 
shapes and sizes. (Reproduced by permission of Bio-Serv Inc.) 

informs you that the pay rate has been reduced to 12.50 a basket. You may well 
be tempted to place your basket gracefully over his and walk off the job. At the 
very least your performance will go down. The reinforcement contrast effect 


170 PART TWO/ACQUISITION 




























Figure 6.8 Idealized positive and negative reinforcement contrast effects. During the 
preshift phase, one group receives a large amount of reinforcement, whereas the other 
receives a small amount. The amounts given to the two groups are then reversed during 
the postshift phase, yielding the contrast effects. 

refers to the fact that it may drop below the level at which you would have 
performed if you had been receiving 12.50 right from the beginning. This is a 
negative contrast effect, and it reflects your “depression.” Temporarily, you do 
not even pick at the rate you would have maintained had you been receiving the 
lower pay rate all along. A positive contrast effect might be demonstrated if your 
pay rate were suddenly shifted upward, say, to 37.50 a basket. You would be 
elated, and your performance might temporarily jump beyond the level at which 
it would have been if you had been receiving 37.50 all along (Figure 6.8). 

These effects have been demonstrated in the laboratory. They were originally 
observed and described by Crespi (1942) and Zeaman (1949). Zeaman trained 
rats to move down runways for either small or large rewards. The amounts were 
then suddenly shifted. His data appeared to reveal both positive and negative 
contrast effects. Later, Collier and Marx (1959) first reinforced rats with one of 
three concentrations of sucrose solution. All three groups were then trained to 
lever-press for the middle concentration. Rats that were initially reinforced with 
the lower concentration and then shifted to the middle concentration produced 
higher response rates than did the rats trained thoughout on the middle concen¬ 
tration (positive contrast). Rats initially trained on the high concentration and 
shifted to the middle concentration displayed lever-pressing rates beloiv those 
displayed by the rats trained on the middle concentration throughout (negative 
contrast). 

Although the negative-reinforcement contrast effect stands as a viable concept 
in the field (Peters &C McHose, 1974), there have been questions about the validity 
of the positive contrast effect. Beginning with Spence (1956) and continuing into 
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more recent years (Dunham, 1968; Dunham & Kilps, 1969; Hulse, 1973), a 
number of investigators have failed to obtain a positive effect or have discredited 
apparent positive effects. For example, Hulse (1973) first rewarded rats using a 
random arrangement of 1 and 10 pellets. Half of the rats then learned to 
lever-press for 1 pellet, whereas the remaining half learned the same response for 
10 pellets. When rats learned to lever-press for 1 pellet following the mixed 
condition, they learned more slowly than if they had received initial learning with 
just 1 pellet. But in contrast to earlier reports the corresponding positive effect 
was not obtained. That is, rats that were shifted to 10 pellets did not exceed the 
performance level that they would have attained had they been trained on 10 
pellets throughout. Whether this and similar failures to find the positive contrast 
effect result from a true lack of the effect or to methodological difficulties remains 
to be seen. 


SECONDARY REINFORCEMENT 

As we have noted, a secondary reinforcer is a neutral stimulus that has, through 
repeated associations with a primary reinforcer, acquired reinforcing properties. 
In a typical animal experiment rats might first be trained to run down a straight 
alley to a black goal box containing food. Once the running response was well 
established, the animals would be removed from the alley and placed in a T maze. 
One arm of the T maze would be black, and the other would be white. No food 
would ever be presented in the T maze. The rat would be allowed to run from the 
starting box and choose one of the arms. In this situation rats typically learn to 
choose the black arm even though food is never presented. Through association 
with the food the black color has become a secondary reinforcer, and it reinforces 
the turning response in the T maze. 

Psychologists have devised a number of procedures for detecting the existence 
and strength of secondary reinforcers. The method just described is called a 
choice test. The animal is allowed to choose between the secondary reinforcer and 
some neutral stimulus configuration (Saltzman, 1949). In a typical extinction test 
(Bugelski, 1938; Melching, 1954; Skinner, 1938), the animal may first be trained 
to obtain food by bar pressing. Each time the bar is pressed, the releasing 
mechanism yields a distinctive clicking sound. This sound becomes a secondary 
reinforcer. Following initial training, extinction is carried out under two condi¬ 
tions. In one, the mechanism is adjusted so that the bar press yields the clicking 
sound but no food. In the other, a bar press produces neither the click nor the 
food. In this situation resistance to extinction tends to be greater when the bar 
press produces the click, demonstrating that the click does, in fact, maintain the 
response. Additional methods are discussed by Gollub (1977). 

Parameters 

Intuitively, it seems possible that a neutral stimulus will become a stronger 
secondary reinforcer if it is paired with a larger amount of primary reward. A $5 
bill is of greater value than a $1 bill, presumably because it is associated with 
more food, comfort, excitement, and so on. The experimental data indicate that, 
if a subject learns to associate one particular stimulus with a large amount of 
reward and another distinct stimulus with a smaller amount, then the stimulus 
associated with the larger amount will become a stronger reinforcer (D’Amato, 
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1955; Fantmo & Logan, 1979; Reynolds, Pavlik, &C Goldstein, 1964). This 
finding supports our intuitive notion. 

It also seems reasonable to assume that the strength of a secondary reinforcer 
will increase as the number of times it is paired with the primary reinforcer 
increases. The available data (for example, Bersh, 1951; Fantino & Flerrnstein, 
1968; Miles, 1956) suggest that this is the case, although the effect seems to be 
less than overwhelming. There is only a slight increase in the strength of a 
secondary reinforcer as the number of pairings with the primary reinforcer 
increases. 

A number of investigators have been concerned with the relationship between 
duration of deprivation, or drive level , and the strength of secondary reinforcers. 
Interestingly, the conclusion we can draw from the results of a sizable amount of 
experimentation is that this variable is not particularly important in determining 
the strength of a secondary reinforcer. Secondary reinforcers acquired under 
conditions of high drive are no stronger than secondary reinforcers acquired 
under conditions of low drive (see Brown, 1956; Hall, 1951). 

Finally, the influences of schedules of reinforcement on the strength and dura¬ 
bility of secondary reinforcers have been examined. Two conclusions can be 
reached. First, intermittent pairing of the primary reinforcer with the secondary 
reinforcer seems to result in a stronger secondary-reinforcement effect (Armus & 
Garlich, 1961; Kendall. 1974). It is as if a $5 bill will become more valuable, and a 
stronger secondary reiaforcer, if it is not always associated with primary rewards. 
Second, secondary reinforcers become even stronger if the intermittent presenta¬ 
tion of a primary reward described above is variable (Davison, 1972; Hursh & 
Fantino, 1973). That is, if a primary reward is presented in a fixed manner, the 
strength of a secondary reinforcer will be less than that obtained if the primary 
reward is presented in a variable manner. A $5 bill will become more valuable if it 
is associated with a primary reward in some variable, intermittent pattern. 

Some Theoretical Issues 

IS CLASSICAL CONDITIONING INVOLVED? Many psychologists feel that secondary 
reinforcers are established through a process of classical conditioning. If a neutral 
stimulus is repeatedly paired with a primary reward(s), then the animal’s reaction 
to the primary reward becomes classically conditioned to the neutral stimulus. 

This mechanism may not be all there is to the establishment of secondary 
reinforcers; there are other theories (see Fantino, 1977; Rachlin, 1976). And yet 
the classical-conditioning interpretation does receive some support in the litera¬ 
ture. Bersh (1951) gave rats a pellet of food 0, .5, 2, 4, or 10 seconds after the 
onset of a light (see Figure 6.9). Following this phase all rats bar-pressed for the 
light alone. Bersh found that the strength of the secondary reinforcer was 
strongest when the CS—UCS interval (time between light and pellet) was approx¬ 
imately .5 of a second. Longer and shorter intervals resulted in weaker secondary 
reinforcers. The superiority of the .5-second interval corresponds with the known 
superiority of the .5-second CS—UCS interval in general classical-conditioning 
situations. 

SECONDARY REINFORCERS AS CARRIERS OF INFORMATION. Egger and Miller 
(1962, 1963) came up with the interesting notion that merely pairing a neutral 
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Figure 6.9 Secondary reinforcer strength as a function of the CS—UCS interval. (Adapted 
from Bersh, P. J. The influence of two variables upon the establishment of a secondary 
reinforcer for operant responses. Journal of Experimental Psychology, 1951, 41, 62—73. 
Fig. 1, p. 66. Copyright 1951 by the American Psychological Association. Reprinted by 
permission.) 

stimulus with a primary reward is not sufficient for the neutral stimulus to 
become a powerful secondary reinforcer. To become powerful the neutral stimu¬ 
lus must convey information about the forthcoming appearance of the primary 
reinforcer. If the neutral stimulus provides the animal with no additional informa¬ 
tion about the upcoming primary reward, then it will be a less effective secondary 
reinforcer. This hypothesis is an extension of an earlier one, sometimes called the 
discrimination hypothesis (Marx & Knarr, 1963); Schoenfeld, Antonitis, & 
Bersh, 1950). 

In one condition of their 1962 experiment Egger and Miller presented two 
neutral stimuli (light and tone) before the presentation of food. One of the stimuli 
(say, the light) was switched on first. One-half second later the tone began. Both 
stimuli would then remain on for 1.5 seconds, at which time the food was 
presented. In this situation the light informed the animal that food was forthcom¬ 
ing. The subsequent beginning of the tone, .5 of a second later, provided the 
animal with little or no new information (it already knew that the food was 
coming). In agreement with the hypothesis the light became a much stronger 
secondary reinforcer than the tone. 

The notion that mere pairing of a neutral stimulus and a primary reward will 
result in the development of a secondary reinforcer is further weakened by the 
recent information interpretations of classical conditioning. You will recall from 
Chapter 2 that Rescorla and Wagner (1972) have shown that classical condition¬ 
ing will not occur if the UCS is as likely to occur in the absence of the CS as it is to 
occur in the presence of the CS. Mere pairing provides only a weak source of 
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conditioning, not only in typical classical-conditioning situations but also in those 
involving the establishment of a secondary reinforcer (Rose & Fantino, 1978; 
Squires, Norborg, & Fantino, 1975). The CS must provide information about the 
probability of the forthcoming UCS if that CS is to become effective. 

DO SECONDARY REINFORCERS HAVE MOTIVATING PROPERTIES? In addition to 
their reinforcing properties secondary reinforcers may have motivating prop¬ 
erties. For example, a $5 bill may not only be a reward; it may also activate us, 
motivate us, or urge us toward action. If you were walking calmly down the street 
and suddenly spied a $5 bill on the sidewalk, your behavior would probably 
change rapidly, taking on an extremely motivated quality; the sight of the bill 
would urge you to action. Although this is an old issue (see Dinsmoor, 1950), 
Williams (1970) has provided some more recent evidence that supports the notion 
that secondary reinforcers motivate as well as reinforce. Rats were trained to run 
down an alley for food. Presentation of the food was accompanied by a tone (the 
secondary reinforcer). During testing all animals were confined in the starting box 
for 20 seconds. During this delay the tone was either not sounded or was sounded 
for an interval of .5, 5, or 15 seconds. The tone was terminated by the opening of 
the door. Williams found that, the longer the animals were subjected to the tone 
in the starting box, the faster they left the box once the door was open. The tone 
seemed to have energized, or motivated, the animals. 

SECONDARY REINFORCERS AS UNDETECTED PRIMARY REINFORCERS. A sneaking 
suspicion that creeps into secondary-reinforcement experiments is that some 
supposedly neutral stimuli may actually be primary reinforcers and, hence, not 
neutral at all. An enormous body of literature documents the fact that humans 
and other animals will respond to and seek out novel and changing stimuli. For 
example, animals will easily respond for nothing more than a change in illumina¬ 
tion (Glow & Winefield, 1978; McCall, 1966). Human infants as young as 3 
months can be trained to make a variety of responses when rewarded with 
various visual and auditory stimuli (Sisqueland, 1970). Such “neutral” stimuli as 
light and sound onset and offset, changed visual patterns, novel odors, and the 
opportunity to explore visually the surrounding environment can be shown to 
have reinforcing properties (Berlyne, 1969; Butler, 1965; Fowler, 1965). Many 
secondary-reinforcement experiments employ such “neutral” stimuli. Adequate 
controls are needed to ensure that they are, in fact, neutral and not reinforcing in 
and of themselves. In choosing neutral stimuli experimenters must be aware of 
this type of possible confounding. 

Tokens and Praise 

It almost goes without saying that secondary reinforcers operate in the human 
realm. They may be very concrete or quite abstract. As we learned in Chapter 2, 
tokens can serve as secondary reinforcers. Nelson and Cone (1979) provide a 
good example of the power of tokens as secondary reinforcers in the treatment of 
psychiatric inpatients. They isolated eight behaviors and first observed the fre¬ 
quency with which these behaviors occurred without any special attention given 
to them. The results of these initial observations are the baseline data in Figure 
6.10. Then they began to reward these behaviors with tokens. Each time one of 
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Figure 6.10 Tokens serve as secondary reinforcers in a token economy for psychiatric 
inpatients. During baseline periods no tokens were distributed. During implementation 
periods the desired behaviors earned tokens. (Adapted from Nelson, G. L., and Cone, 

J. D. Multiple-baseline analysis of a token economy for psychiatric inpatients. Journal of 
Applied Behavior Analysis , 1979, 12, 255-271. Figs. 1 and 2, pp. 264-265.) 

the behaviors was observed, the patient received a token that could later be 
exchanged for goods and supplies at the hospital store. As the implementation 
data in Figure 6.10 indicate, token, or secondary, reinforcement was effective in 
increasing all eight desired behaviors in this group of patients. 

On a less physical level Skinner (1953) has suggested that social reinforcers 
such as attention, approval, and affection may actually be secondary reinforcers. 
For example, such verbal gestures as “fine,” “good,” “excellent,” and so on may 
represent secondary reinforcers. In addition, confirmation of correct responding, 
or positive feedback, may be a source of secondary reinforcement (Fitts & Posner, 
1967). 


THEORIES OF REINFORCEMENT 

Thus far we have been concerned with observable reinforcement effects resulting 
from the manipulation of a number of critical independent variables. We now 
turn to a consideration of some of the analytical thinking that psychologists have 


176 PART TWO/ACQUiSITlON 




































done with respect to the reinforcement issue. Why do reinforcers reinforce? What 
are some of the major theoretical interpretations of reinforcement? We shall 
review a few of the more important conceptions, drawing on a taxonomy sug¬ 
gested by Tapp (1969). 


Drive-Reduction Theory 

Drive-reduction theory, as exemplified by Hull’s system, has already been dis¬ 
cussed. According to this approach, reinforcement is equivalent to drive reduc¬ 
tion. An animal is in a high drive state if it is experiencing some homeostatic 
imbalance, some internal physiological need, or some type of strong unpleasant 
stimulation. If the animal makes a response that reduces the noxious stimulation 
caused by these sorts of factor, then that response is strengthened. Reinforcement 
refers to the diminution of noxious drive stimuli. Animals will acquire responses 
that have been instrumental in reducing drive stimuli (see Houston, 1985). 

The drive-reduction conception appears to be adequate when one considers 
learning related to basic biological drives such as hunger and thirst. When 
deprived of food or liquid, the animal experiences a state of tension. Responses 
that reduce these tension states are learned. If the basic biological drives were all 
that we had to consider, then the drive-reduction conception would seem to be a 
comprehensive theory. The difficulty lies in the fact that the past 30 years have 
seen an explosion of research concerned with rewards that do not reduce any 
known, or apparent, tissue need. Hundreds of studies have shown that humans 
and other animals will seek novel, changing, and complex stimuli and that the 
receipt of these stimuli will be reinforcing. For example, Butler (1953) has shown 
that monkeys will learn visual discriminations for nothing more than the oppor¬ 
tunity to look about a typical laboratory for a few moments. What noxious 
biological drive state is reduced by looking about a laboratory? Opponents of the 
drive conception contend that all sorts of curiosity and exploratory behaviors 
cannot be related to any known drive state. 

The drive-reduction theoreticians respond bravely by postulating new drives to 
account for these sensory-seeking behaviors. For example, Montgomery (1955) 
and others have spoken of an exploratory drive that parallels the more basic 
drives. Fowler (1965) and others have proposed a boredom drive to account for 
certain sensory-seeking behaviors. Harlow (1953) has postulated a manipulation 
drive. An activity drive has been suggested by several authors (see Cofer & 
Appley, 1964). These are but some of the new drives proposed; the list grows 
longer. 

It is too early to determine whether the postulation of these drives is justified. 
But strong criticism has been directed toward the business of inventing new drives 
every time a new stimulus is found to be reinforcing. The critics of the drive- 
reduction position feel that there must be a simpler way to account for these 
diverse behaviors. To fit them all into the drive-reduction mold requires, for 
some, too much bending and tailoring. They feel that the drive reductionists are 
stretching their point. The drive-reduction notion might account for a good deal 
of behavior, but why must it account for all reinforcing events? Perhaps there are 
other reinforcement mechanisms that can supplement, and not necessarily re¬ 
place, the drive-reduction conception. 
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Optimal-Arousal-Level Theories 

One alternative to the drive-reduction approach can be labeled the optimal- 
arousal-level conception. A number of psychologists (for example, Berlyne, 1969; 
Fiske & Maddi, 1961; Routtenberg, 1968) have noted that reinforcement does not 
always seem to involve a reduction in stimulation, as implied by the drive- 
reduction position. It would appear that increases in tension, arousal, or stimula¬ 
tion sometimes seem to be reinforcing. Accordingly, optimal-arousal-level 
theoreticians argue that there is some preferred level of arousal that the animal 
seeks to maintain. On the one hand, when arousal becomes too great, any 
response that reduces arousal will be reinforced. When we are hungry, responses 
that are instrumental in reducing the hunger will be reinforced. When we have 
had enough of a noisy party, escape from that party will be a reinforcing event. 
On the other hand, if arousal falls below a certain level, then any response that 
increases the arousal level will be reinforced. For instance, if you had been sitting 
around your apartment for days with nothing to do, you would probably engage 
in behavior that would increase your level of arousal or stimulation. Any move¬ 
ment toward the optimal level, whether an increase or decrease, would be 
reinforcing. 

This type of theory is appealing, because it fits the fact that animals sometimes 
seek to increase, and sometimes to decrease, stimulation. Its principal shortcom¬ 
ing lies in the fact that it is difficult to quantify, or define, exactly what is meant by 
arousal. The arousal concept seems similar to the drive concept in that each refers 
to a state of tension. The principal difference between them is that drive reduc- 
tionism argues that the animal strives to attain a zero drive level, whereas the 
optimal-level theory argues that the animal attempts to maintain some preferred 
level of arousal greater than zero. 

Stimulus Theories 

The drive-reduction and optimal-arousal-level theories emphasize some internal 
state that the animal attempts to change or maintain. A number of theories differ 
from these two in that they focus on the impact of external stimuli rather than on 
the internal state of the organism. There are several varieties. Some of them are 
quite distinct, whereas others overlap with one another or with the conceptions 
just outlined. Flere are three examples. 

STIMULUS-CHANGE THEORIES. McCall (1966) has argued that changes in level of 
stimulation will be reinforcing. Up to a point, the greater the change, the greater is 
the reinforcing effect. McCall has been able to demonstrate that rats will work to 
obtain changes in level of illumination. The emphasis here is on a change in 
quantity rather than in quality. Although perhaps somewhat limited, the theory 
does have heuristic value. It is easy for us to imagine ourselves acting in a manner 
that will bring about stimulus change. 

PREFERRED-LEVEL THEORIES. In contrast to McCall’s theory, Lockhard (1966) 
suggests that the animal seeks a preferred level of stimulation rather than a simple 
change in stimulation. Any change that is in the direction of the preferred level 
will be reinforcing. Any change that is away from the preferred level will be 
punishing. The similarity between this theory and the optimal-arousal-level 
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theories is obvious. The primary difference between them is that one focuses on 
internal states, whereas the other attends to external stimulation. 

STIMULUS-QUALITY THEORIES. Whereas the stimulus-change and the preferred- 
level theories focus on the amount of stimulation, Pfaffmann (1969), P. T. Young 
(1966), Sheffield (1966), and others have directed their attention toward the 
quality of the stimulus. In general, their theories are concerned with the fact that 
certain stimuli (for example, sweet ones) are genetically preferred. The presenta¬ 
tion of these stimuli is assumed to be reinforcing in some predetermined fashion, 
regardless of the arousal state of the animal. Thus, an animal will work to obtain 
a saccharin solution, even though the solution has no nutritional value. The 
animal merely likes the flavor and will work for it. 


Response Theories 

THE PREMACK PRINCIPLE Response theories of reinforcement emphasize the 
fact that most reinforcing events involve some associated consummatory response 
(for example, chewing, manipulating, drinking, swallowing, copulating). The 
making of the response itself may be an important part of the reinforcing event. 
The emphasis in these theories is on the rewarding properties of consummatory 
responses. For example, we all like to chew food and to swallow it. These acts can 
be reinforcing. Consummatory theories do not focus on any tension-reducing 
events. 

Although consummatory theories have taken several forms (for example, 
Glickman & Schiff, 1967; Sheffield, 1966), one of the more interesting variations 
is that proposed by Premack (1965, 1971). According to Premack, one response 
can be used to reinforce another. Given two responses, one of which is more likely 
to occur than the other, the more likely one can reinforce the occurrence of the 
less likely one. For example, Premack (1959) found that, if children would rather 
eat candy than play a game, it is possible to increase their game playing by 
reinforcing it with candy eating. Similarly, if children prefer to play a game, their 
candy-eating behavior can be increased by reinforcing it with game playing. 
Parents often hear their children saying things like “I’ll play checkers with you if 
you’ll play cards with me.” (What appears to be a marvelous example of coopera¬ 
tion may then, of course, be scuttled by the impossible question: Which game is to 
be played first?) 

RESPONSE-DEPRIVATION HYPOTHESIS. Timberlake and Allison (1974) have out¬ 
lined another related interpretation of reinforcement (see also Timberlake, 1980). 
Quite simply, if we deprive an organism of the opportunity to make a response as 
often as it normally does, that response will become a reinforcer. Suppose a young 
child spends a lot of time riding his bike. One day his mother says, “Jimmy, if you 
pick up your toys, I’ll let you ride your bike” Jimmy may think, “No. I get to ride 
all the time anyway.” Bike riding will not be reinforcing. But suppose that before 
making her offer the mother first deprives Jimmy of the opportunity to ride for 
two days. Then, when she says he can ride if he picks up his toys, he will jump to 
the task. The response of bike riding has become reinforcing because Jimmy has 
been deprived of the activity. 
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Which Theory Is Correct? 

A number of investigators have discovered that a rat, given the choice between 
eating free, available food or pressing a bar to receive food, will often choose to 
“work” for its food (see Osborne [1977], for a review). The rat will press the bar 
even though it does not need to because “free food” is always available. 

Although the effect itself is intriguing, it is also an instructive example. It can 
help us see how the many theories of reinforcement that we have been discussing 
can all interpret the same set of data. For example, when using stimulus theories 
one has no trouble interpreting the free-food effect. One merely argues that the 
rat presses the bar because it is seeking stimulus change, or some preferred level of 
stimulation, or some specific quality of stimulation. In optimal-arousal-level 
terms it would be argued that bar pressing is maintained because it stirs things up 
a bit and moves the animal into its preferred level of arousal. Response theorists 
would argue that the “consumption” of bar-press responses is reinforcing. And 
drive theorists would suggest that bar pressing reduces an activity drive. 

The problem is evident. If all the theories can predict the same events, it is 
difficult to choose among them. What we must await are a number of accurate 
predictions that can be made only on the basis of one of the theories. Then we 
shall know that we have the most accurate theory. 

THE PHYSICAL BASIS OF REINFORCEMENT 

Theorizing about reinforcement on a behavioral level is an ongoing process (see 
Bindra, 1974; Klinger, 1975; Rachlin &C Burkhard, 1978; Solomon & Corbit, 
1974). In addition, the relationships between events in the brain and external 
reinforcing events are now under intensive study. Psychologists, in what promises 
to be an exciting endeavor, are beginning to explore the chemical and electrical 
nature of reinforcement. 


Electrical Brain Stimulation 

One line of research underscores the electrical nature of the brain. These studies 
have to do with the rewarding and punishing effects associated with the electrical 
stimulation of various parts of the brain (see Olds & Milner, 1954; Valenstein, 
1973). In a typical experiment an extremely small electrode is permanently 
implanted in the brain of the subject (a rat, cat, or the like). The tip of the 
electrode is directed toward the particular portion of the brain of interest to the 
experimenter. (Interestingly enough, the experimenter is never quite sure where 
the tip ends up until after the experiment is completed, the animal is sacrificed, 
and its brain is sectioned, mounted on slides, and examined.) Once the electrode 
is in place, a small electrical current can be delivered through the tip to the 
surrounding area. This little shock stimulates the brain cells in the immediate area 
and causes them to fire. 

The experimenter can arrange the situation so that, when the animal presses a 
bar, it will receive a certain amount of brain stimulation. In other words, the rat 
itself determines whether it will receive electrical brain stimulation. If the animal 
learns to press the bar repeatedly, then we suspect that the area in which the 
electrode is implanted is a reward, or reinforcing, or “go” center. If the animal 
refrains from pressing the bar, we assume that the area is either neutral or 
punishing. If the animal will learn a particular response that terminates brain 
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stimulation, we assume that this particular area is punishing. By implanting 
electrodes in various parts of the brain and testing the rat’s tendency to maximize 
or minimize stimulation, we are able to map the brain in terms of its involvement 
with reinforcement and punishment. As it turns out, there are many reinforcing 
and punishing areas throughout portions of the brain lying below the cortex, in 
particular, in the area of the hypothalamus. 

Studies of reinforcing areas have yielded some striking results. An animal 
pressing a bar for reinforcing brain stimulation just does not seem to want to 
stop. If we reinforce a hungry rat with food, it will, sooner or later, satiate and 
stop responding. But the animal rewarded with brain stimulation will press the 
bar for hours on end, sometimes stopping only when it drops from fatigue. 
Another interesting finding is that, once the electricity is turned off, the behavior 
(such as bar pressing) ceases almost immediately. This rapid cessation does not 
correspond to “normal” learning situations, in which most responses extinguish 
quite slowly once reinforcement is removed. 

Exactly what the animal is experiencing during brain stimulation is uncertain. 
It is often assumed that the stimulation of reinforcement centers produces positive 
affect, or, in some sense, pleasant sensations. This interpretation is supported by 
studies in which humans undergoing operations report stimulation of their brain 
to be joyful or relaxing (Heath &C Mickle, 1960). But there are other ways to think 
about the situation. For example, Milner (1970) refers to response hold and 
response change neural systems, which determine whether the animal continues 
what it has been doing or changes to some new behavior. In other words, 
stimulation of a so-called reinforcement center might not necessarily be pleasant. 
It might merely initiate neural instructions to repeat the last response. Although it 
is convenient to think of reward centers and punishment centers , we should keep 
in mind that these conceptions are hypotheses about what is going on and not 
clearly established facts. 

In summary, electrical-stimulation techniques have generated a good deal of 
theory and research. At the same time, it should be recognized that the method is 
in its infancy (see Livesey & Rankine-Wilson, 1975). Stimulation techniques are 
generating interesting ideas about how the brain is organized, but at present these 
ideas are highly speculative. (Although the ideas may have crept into a science 
fiction story or two, we are a long way from “controlling” one another through 
the use of electrical techniques, and it is highly unlikely that we will all soon be 
pressing little buttons that automatically stimulate our “pleasure” centers.) 

Production of Endorphins 

Research into the physical basis of reinforcement continues on other levels too. 
For example, we have already seen in Chapter 2 that the brain produces opiate¬ 
like pain-reducing substances such as endorphins when pain becomes too great 
(Lewis &: Leibeskind, 1983). Because pain can be involved in reinforcement (such 
as when it is removed), these new and intriguing research efforts are really 
exploring the very essence of reinforcement on a physical level. 

Synaptic Transmission 

Finally, a great deal of attention is being given to the study of the chemical and 
electrical events that occur between neurons, or within the synapse. For example, 
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Cholinesterase 



Figure 6.11 Illustration of biochemical events involved in a cholinergic synapse. (Adapted 
from Gordon H. Bower and Ernest R. Hilgard, Theories of Learning, © 1981, p. 486. 
Reprinted by permission of Prentice-Hall, Inc., Englewood Cliffs, New Jersey.) 

Deutsch (1968, 1971) and Deutsch and Deutsch (1973) have proposed that 
learning involves the enhancement of synaptic transmission through the produc¬ 
tion of more efficient concentrations of acetylcholine (ACh). Figure 6.11 is a 
simplified version of what may be involved in the transmission of an impulse 
across a cholinergic synapse. An electrical impulse is presumed to travel along the 
presynaptic terminal. This impulse, on reaching the end of the presynaptic cell, is 
presumed to stimulate the release of ACh from a set of vesicles. These bits of ACh 
drift across the synaptic gap and are taken up by ACh receptors located on the 
postsynaptic terminal, or neuron. If enough ACh is taken up by these receptors, 
an electrical impulse is presumed to be triggered on the postsynaptic side. This 
impulse then travels on down the postsynaptic neuron to the next synapse. In 
other words, Deutsch and his colleagues suggest a chemical mechanism mediates 
the transfer of an electrical impulse across the synaptic gap. 

Another substance, cholinesterase, is presumed to be present in the area of the 
synaptic gap. This cholinesterase performs a critical function in the transmission 
process. Specifically, it neutralizes, or “devours,” the ACh soon after it is released 
by the presynaptic terminal. If the ACh were not neutralized, then the postsynap¬ 
tic terminal would become “jammed” with ACh and could not fire again. For the 
postsynaptic cell to fire again, the old ACh must be neutralized, and new ACh 
must arrive. Hence, the cholinesterase “clears away” old ACh so that the trans¬ 
mission mechanism can operate again. 

Although in this discussion we emphasize ACh because of its importance, you 
should be aware that many other neurotransmitters are under investigation and 
that some of these inhibit rather than facilitate impulse propagation. In fact, 10 to 
20 different types of synapse are suspected to exist in mammals, and each is 
thought to involve different transmitter substances. So perhaps the assertion 
made by Deutsch and his colleagues is an oversimplification. 
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More recently, Stein and his co-workers (Stein, 1978) have suggested that 
reinforcement and punishment may involve entirely different transmitter sub¬ 
stances. Specifically, punishment may involve the transmitter called serotonon, 
whereas reward may involve a catecholamine as a transmitter. 

Clearly, these efforts are hut scratching the surface of a very complicated area 
of research. In fact, new and exciting findings can be expected in the entire field 
concerned with the physical, electrical, and chemical bases of learning and mem¬ 
ory. This introduction has been selective; there are many intriguing areas that you 
might explore (for example, electroconvulsive shock and retrograde amnesia). 
Cotman and McGaugh (1980) provide a good summary of recent developments 
in these areas. 


SUMMARY 

1. Beyond simple operational notations there is very little agreement among 
psychologists concerning the nature of reinforcers. It is clear that the impor¬ 
tant parameters of reinforcement include schedules, amount, and delay of 
reinforcement. 

2. Continuous reinforcement (CRF) refers to situations in which every response 
is reinforced. 

3. In a fixed-interval schedule the animal is reinforced for the first response that 
occurs after the end of a given time interval. Long pauses that follow 
reinforcement in the FI schedule produce a characteristic “scalloping” effect 
in cumulative response records. 

4. In variable-interval (VI) schedules the animal is forced to wait for varying 
amounts of time before receiving reinforcement. The VI schedule produces 
steady responding. The longer the average interval, the slower the respond¬ 
ing. 

5. In the fixed-ratio (FR) schedule the animal must make a certain number of 
responses before it is rewarded. The FR schedule yields high rates of respond¬ 
ing and a mild postreinforcement pause. The higher the ratio, the longer is 
the postreinforcement pause. 

6. In the variable-ratio (VR) schedule the number of required responses varies. 
This schedule also produces high response rates but normally does not yield 
postreinforcement pauses. 

7. Partial reinforcement schedules sometimes lead to higher response rates than 
does CRF. 

8. Multiple schedules consist of two or more simple schedules presented one 
after the other. Distinctive cues are associated with each simple phase. 
Performance on any given phase may be affected by previous phases. In 
mixed schedules , a variation of the multiple pattern, the distinctive cues are 
not present. 

9. In chained schedules two or more successive simple schedules must be 
completed before a reinforcement can be obtained. Distinctive cues are 
associated with each simple phase. Tandem schedules are like chained sched¬ 
ules except that the distinctive cues are not present. 
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10. In concurrent schedules the animal is faced with two levers, each of which is 
associated with its own schedule. 

11. In concurrent situations the frequency of responding on an alternative key 
matches the relative frequency of reinforcement for responses on that 
alternative. This is the matching effect. 

12. Matching may maximize the number of reinforcements we receive. 

13. Schedule-induced behavior, such as attack, pacing, and jaw clenching, occurs 
during intervals between reinforcements and may be reinforced because it 
reduces tension or increases stimulation. * 

14. Schedule-induced behavior first increases and then decreases as the time 
between reinforcements increases. 

15. Autoshaping refers to the fact that animals will sometimes develop a re¬ 
sponse to a cue even though that cue is always followed by reinforcement. 
Autoshaping suggests that classical conditioning is intertwined with what is 
normally called instrumental conditioning. 

16. The partial reinforcement effect refers to the fact that resistance to extinction 
following partial reinforcement is greater than resistance to extinction fol¬ 
lowing CRF. 

17. Frustration theory, sequential theory, and the discrimination hypothesis have 
all been proposed as explanations of the PRE. 

18. Humans respond to many schedule manipulations in roughly the same 
manner as do “lower” animals. 

19. If a reward is delayed following the occurrence of a response, then perform¬ 
ance drops off. If the reward is delayed too long, then learning will not occur 
at all. And yet long delays sometimes appear not to hinder learning in the 
slightest. Successful learning under these conditions has been attributed to 
the presence of undetected secondary reinforcers immediately following the 
response. (A secondary reinforcer is any neutral stimulus that, through 
repeated pairings with a primary reward, has acquired reinforcing properties 
of its own.) 

20. If confounding secondary reinforcers are removed from the delay-of-rein- 
forcement experiments, then learning under conditions of delayed primary 
reinforcement drops off dramatically. Some psychologists feel that learning 
cannot occur at all unless some type of reinforcement (primary or secondary) 
is immediate. 

21. Delays in the receipt of primary reinforcement are not particularly effective 
in reducing human performance. Humans are better able to bridge time gaps 
than are other animals. 

22. Performance tends to increase as the amount of reinforcement is increased, 
up to a point. 

23. Three components of amount of reward have been identified. They include 
the volume or mass of the reward, the amount of consuming activity, and the 
quality of the reward. All three have been found to be important in determin¬ 
ing reinforcement effects. 

24. Negative-reinforcement contrast refers to the fact that if an animal has been 
working for a high level of reward and is suddenly shifted to a lower level, its 
performance will temporarily drop below the level that it would have main- 
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tained had it been experiencing only the lower level all along. Although the 
negative-reinforcement contrast effect is fairly well established, the corre¬ 
sponding positive-reinforcement contrast effect has been difficult to obtain. 

25. The existence and strength of secondary reinforcers are detected by various 
means, including choice tests and extinction tests. 

26. Secondary reinforcers can become stronger if they are paired with larger 
amounts of primary reward. 

27. The strength of a secondary reinforcer increases slightly as the number of 
times it is paired with the primary reward is increased. 

28. Duration of deprivation, or drive level, seems to be relatively unimportant in 
determining the strength of secondary reinforcers. 

29. Schedules of reinforcement appear to have a significant impact on the 
strength of secondary reinforcers. In general, intermittency leads to an in¬ 
crease in the durability of secondary reinforcers, as does variability. 

30. Secondary reinforcers may be established through a process of classical 
conditioning. 

31. To become powerful secondary reinforcers, stimuli may have to carry non- 
redundant information about the forthcoming occurrence of the primary 
reward. 

32. Secondary reinforcers may have motivating properties. 

33. Some so-called secondary reinforcers may be nothing more than undetected 
primary reinforcers, in that some so-called neutral stimuli may satisfy the 
animal’s requirements for novel, changing, and complex stimulation. 

34. Human performance is strongly affected by secondary reinforcers such as 
tokens and praise. 

35. Returning to the theoretical controversy, we find that drive-reduction theory 
has difficulty explaining certain sensory-seeking behaviors such as explora¬ 
tion. Many psychologists are unwilling to believe that all reinforcing events 
involve the reduction of noxious stimulation. They also object to the endless 
invention of new drives to account for newly discovered reinforcing stimuli. 
The drive-reduction theory is a good one, but it may not be comprehensive. 

36. Alternative theories include the optimal-arousal-level conceptions, which 
assume that the animal seeks to maintain some preferred level of arousal. 

37. There are several stimulus theories of reinforcement. The stimulus-change 
theorists argue that changes in level of stimulation are reinforcing. The 
preferred-level theorists argue that any change away from a preferred level of 
stimulation will be punishing, whereas any change toward the preferred level 
will be reinforcing. 

38. Stimulus-quality theorists contend that certain stimuli (such as sweet ones) 
are innately preferred and that their presentation will be reinforcing. The 
emphasis here is on quality rather than quantity. 

39. Response theories of reinforcement argue that consummatory responses are 
reinforcing. Premack’s variation on this theme holds that, if one of two 
responses is more likely to occur, the occurrence of the less likely one can be 
increased by reinforcing it with the opportunity to perform the more likely 
one. 

40. The response deprivation hypothesis is another example of response theory. 
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41. At present, none of these theories is any more valid than the others. Each 
probably has something to contribute to the overall picture of reinforcement, 
and each can account for many of the data, such as the free-food effect. 

42. The physical basis of reinforcement is of interest to many researchers. Areas 
of study include electrical brain stimulation, endorphins produced by the 
brain, and the nature of synaptic transmission. 
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PART 

THREE 

TRANSFER 


I t is clear that we rarely, if ever, 
learn anything entirely new. No 
matter how novel a new learning 
situation appears, past experiences 
and habits will be brought into play. 
We now turn to a consideration of 
transfer , or the effects of our prior 
learning on subsequent learning. 
Transfer is part and parcel of the en¬ 
tire learning process, which also in¬ 
cludes acquisition and retention. 

It is almost unnecessary to point 
out the tremendous importance of 
transfer in our daily life. We never 
truly learn anything in a vacuum. For 
example, the mere fact that we have 
acquired language guarantees that 
many new learning situations will be, 
in effect, transfer tasks. To the extent 
that the new learning task involves 
language, we are involved in a trans¬ 
fer situation. 

We shall begin our study of transfer 
by considering the complementary 
processes of generalization and dis¬ 
crimination. Generalization refers to 
the fact that, if a response is con¬ 
nected to one particular stimulus, the 
subsequent presentation of similar 
stimuli will tend to elicit that same re¬ 


sponse. Generalization is a type of 
transfer effect in that training with 
one stimulus affects future perform¬ 
ance. Discrimination is, in a sense, 
the opposite side of the same coin. It 
refers to the reducing of generaliza¬ 
tion through a process of differential 
reinforcement. The range of stimuli 
that will elicit a particular response is 
decreased. The issues of generaliza¬ 
tion and discrimination will be dis¬ 
cussed in terms of both animal and 
human action 

We shall also consider transfer of 
training. In studies on this process 
subjects typically learn two successive 
verbal tasks, which may vary in their 
similarity to each other. Under some 
circumstances learning of the second 
task is facilitated by the prior learn¬ 
ing. Under other circumstances it is 
hindered. Both empirically deter¬ 
mined and theoretically envisioned 
principles of transfer will be outlined. 

In addition, an attempt will be 
made to point out the ways in which 
generalization and transfer of training 
are intimately related to each other. 
Both involve the impact of prior 
learning on subsequent performance. 
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Figure 7.1 Idealized stimulus generalization gradients that might be obtained from groups 
of birds trained at different points along a wavelength continuum. 

GENERALIZATION 

As we just saw, stimulus generalization refers to the fact that, if a response is 
learned to one particular stimulus, similar stimuli will tend to elicit that response. 
The greater the similarity, the greater is the tendency. For example, babies may 
say “Dada” to any approaching man, even though they have been reinforced for 
saying “Dada” only in the presence of their father. The more the stranger looks 
like the father, the more likely the baby is to say “Dada.” 

On a more experimental level we should note the widely quoted results 
obtained by Guttman and Kalish (1956). These researchers trained pigeons to 
peck an illuminated key of a certain color. Following training the birds were 
presented with a random sequence of colored keys that varied in their similarity 
to the original training stimulus. Each group produced what is known as a 
stimulus generalization gradient. That is, the greatest numbers of responses 
occurred during the presentation of the original training stimuli. As the various 
test stimuli became less similar to the original stimulus, the numbers of evoked 
responses decreased. The birds, originally trained at different points along the 
color continuum, produced relatively symmetrical generalization gradients 
around their particular training stimulus value. Figure 7.1 contains some idealized 
gradients that might be obtained in this sort of experiment. 

The above example is an instance of the generalization of an instrumentally 
conditioned response. But classically conditioned responses generalize, too. For 
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Figure 7.2 Generalization of classically conditioned nictitating-membrane response in the 
rabbit. [Adapted from John W. Moore, “Stimulus Control: Studies of Auditory 
Generalization in Rabbits” in Classical Conditioning II: Current Theory and Research, 
Black/Prokasy (Eds.), © 1972, p. 215. Reprinted by permission of Prentice-Hall, Inc., 
Englewood Cliffs, New Jersey.] 

example, Moore (1972) classically conditioned the rabbit’s nictitating membrane 
(one of the rabbit’s three eyelids), using a 1,200-hertz tone as the CS. The UCS 
was a shock delivered to the skin near the eye. After conditioning, wherein the 
1,200-Hz tone came to elicit an eye blink, Moore tested for extinction of the 
conditioned response. He tested for resistance to extinction by presenting not 
only the 1,200-Hz tone but by presenting higher and lower tonal frequencies as 
well. As you can see in Figure 7.2, the nictitating-membrane response generalized 
to the new test frequencies. The closer the test stimulus was to the original 
1,200-Hz training tone, the greater was the relative number of conditioned eye 
blinks made during extinction. 

It is clear that the ability to generalize can be either adaptive or maladaptive. 
On the one hand, it would be inconvenient, to say the least, if we failed to 
recognize or respond to strangers as humans. On the other hand, it would not be 
particularly adaptive if we generalized too much (for example, if we “recognized” 
approaching animals as humans). 

Variables Affecting Generalization 

As is true of many learning phenomena, there are many variables that affect the 
shape and form of the stimulus generalization gradient. We shall discuss a few of 
them. 

EXTENDED TRAINING. What will happen to stimulus generalization as rein¬ 
forced responding to the training stimulus is increased? For example, what will 
happen to the baby’s tendency to say “Dada” to strangers as the number of times 
she is reinforced for saying “Dada” to her father is increased? On an intuitive 
level we might expect an initial increase in stimulus generalization. When the 
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baby first receives reinforcement for saying “Dada,” the father might not be all 
that distinct to her. Hence, we might find an initial failure to distinguish between 
the father and strangers. But as training continues, the baby gains experience. The 
father becomes more distinct to her. After considerable training she might be 
much less likely to confuse strangers with her father. Generalization might 
decrease. 

The experimental data pertaining to this issue are equivocal. On the one hand, 
Hearst and Koresko (1968) gave groups of pigeons different numbers of training 
trials on a task where the pecking of a vertical line was reinforced. They then 
tested for generalization by presenting lines placed at various angles away from 
the vertical position. They found that the generalization gradients became steeper 
as the number of training trials increased (see also Hovland, 1937; Razran, 1949). 
But some investigators have found that extended reinforcement on the original 
training stimulus leads either to no change or to an actual flattening of the 
gradient (Margolius, 1955; McCain &C Garrett, 1964). The situation appears to be 
a very complex one, and it is not yet fully understood. The interested reader is 
referred to Blough and Blough (1977), Mackintosh (1974, 1977), Steinhauer, 
Davol, and Lee (1977), and Van Houten and Rudolph (1972) for more in¬ 
formation. 

THE EFFECTS OF PARTIAL REINFORCEMENT. Suppose, when referring to her 
father as “Dada,” our baby is sometimes rewarded (“Gooood baby!”) and 
sometimes not (silence). What effect might this partial reinforcement have on the 
extent to which the response will generalize to other men? 

Much of the experimental evidence suggests that generalization will be more 
extensive following partial reinforcement (see Humphreys, 1939; Wickens, 
Schroder, & Snide, 1954). For example, Hearst, Koresko, and Poppen (1964) 
trained pigeons to peck a key on four different VI schedules. They found that, as 
the intervals increased, the generalization gradients became flatter. In other 
words, generalization increased as the reinforcement became more intermittent. 

However, just as a simple answer eluded us in trying to understand the 
relationship between extended training and generalization, things become pretty 
complicated here, too. For example, Blough (1975) trained pigeons to peck a light 
of a certain wavelength. He used an FI 20 schedule. That is, the pigeons had to 
wait 20 seconds between reinforcements. The interesting thing that Blough did 
was to test for generalization at different points during the 20-second interval (see 
Figure 7.3). Some of the time he presented a new test light toward the beginning 
of the interval, just after a reward had been received for pecking the training light, 
and sometimes he presented a new test light later in the interval, after the bird had 
been waiting for a while. Notice in Figure 7.3 that, if he tested early in the 
interval, the gradients were steep; the birds tended not to peck similarly colored 
lights if they had just been reinforced for pecking the training light. But when the 
test lights were presented later in the interval, the gradients became very flat; the 
pigeon would peck a light at just about any wavelength if it had been waiting. 

The Role of Experience in Generalization 
Suppose that a baby is kept at home most of the time. He sees very few strangers. 
The only man he knows is his father, and he calls him “Dada.” He has had no 


192 PARI THREE/TRANSFER 







Figure 7.3 The effects of partial reinforcement on stimulus generalization as influenced by 
time of testing during an FI interval. The birds were tested for generalization either early 
or later in a 20-second FI interval. Generalization was extreme (flat gradients) when tests 
occurred later and minimal (steep gradients) when testing occurred early. (Adapted from 
Blough, D. S. Steady state data and a quantitative model of operant generalization and 
discrimination. Journal of Experimental Psychology: Animal Behavior Processes, 1975, 
104, 3—21. Copyright 1975 by the American Psychological Association. Reprinted by 
permission.) 

experience with a dimension of men. Suppose that we now expose the baby to a 
variety of strange men. What kind of generalization will occur? Will he generalize 
more or less than a baby who has been exposed to many different men all along? 
There have been two general positions concerning this issue. Lashley and Wade 
(1946), on the one hand, argue that the inexperienced baby will respond equally 
to all men. The stimulus generalization gradient will be flat. They argue that the 
stimulus dimension (for example, a dimension of men varying in similarity to one 
another) does not “exist” for the baby until he has had experience with it. Until 
the baby has had a chance to distinguish among, or compare, men, all men will be 
equivalent. All men, including the father, will be perceived as the same and will be 
responded to identically. On the other hand, Hull (1943) and Spence (1937) 
would argue that the baby will be less likely to respond to less similar men even 
though he has never had the opportunity to experience various points along the 
dimension. They would argue that the gradient generated by our inexperienced 
baby would be steep, and not flat. 

There is some evidence that supports the Lashley—Wade position. Peterson 
(1962) raised ducks in yellow light (5S9 nanometers). They never had the oppor¬ 
tunity to distinguish one wavelength from another. A control group was raised in 
normal light. Both groups then were trained to peck a key illuminated by the 
589-nm light. Stimulus generalization was then tested by illuminating the key 
with eight different wavelengths, ranging from 490 nni to 650 nm. The results of 
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Figure 7.4 Generalization gradients obtained from birds raised in monochromatic 
environment (dotted line) and from birds raised in normal light (solid line). (Adapted 
from Peterson, N. Effect of monochromatic rearing on the control of responding by 
wavelength. Science , 1962, 136, 774—775. Fig. 2, p. 774. Copyright 1962 by the 
American Association for the Advancement of Science.) 

the experiment are shown in Figure 7.4. The ducks raised in yellow light general¬ 
ized perfectly, or failed to discriminate among the various test stimuli. But the 
controls sharply curtailed their responses as the similarity between the training 
and the test stimuli decreased. This result supports the Lashley—Wade notion that 
experience with the stimulus dimension is important in producing generalization 
decrements. Ganz and Riesen (1962) obtained similar results with infant monkeys 
raised in total darkness. 

Walk and Walters (1973) raised one group of rats in the dark (for up to 60 
days) and a control group in normal light. The visually deprived rats showed 
deficiencies in the ability to discriminate depths. They tended to respond to cliffs 
of varying depths (4, 6, and 8 inches) as though they were not really there. That is, 
they tended to tumble over the cliff more often than rats raised in normal light. 

However, there are also data indicating that steep rather than flat gradients 
may be obtained in the absence of any prior exposure to the many points along 
the stimulus dimension (Jenkins & Harrison, 1960; Thomas, Mariner, & Sherry, 
1969). These investigators have shown that under certain rather specific condi¬ 
tions steep gradients may be obtained when the animal has experienced only a 
single value along a stimulus dimension (Rudolph & Honig, 1972; Tracy, 1970). 
For example, Riley and Levin (1971) raised newly hatched white leghorn chicks in 
589-nm light. The chicks were trained to peck a key that was also illuminated 
with 589-nm light. Then in one condition the chicks were tested for their tenden- 
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Figure 7.5 Mean relative generalization gradient for chicks tested with all three 
wavelength values. (Adapted from Riley, D. A., and Levin, T. C. Stimulus-generalization 
gradients in chickens reared in monochromatic light and tested with a single wavelength 
value. Journal of Comparative and Physiological Psychology, 1971, 75, 399-402. 
Copyright 1971 by the American Psychological Association. Reprinted by permission.) 

cy to peck keys illuminated by either 589-nm, 569-nm, or 550-nm light. As you 
can see in Figure 7.5, the obtained gradient is not flat as predicted by the 
Lashley—Wade position. 

The conclusion we can draw is that prior experience with many points along 
the dimension may not be absolutely essential for the appearance of a steep rather 
than a flat gradient, but it helps. Our stay-at-home baby might be less likely to 
call strangers “Dada” than he would his real father, but in all likelihood the 
tendency to restrict his response to his father would be even stronger had he had 
prior experience with a variety of strangers. 

Primary and Secondary Generalization 
Hull (1943) describes primary generalization as that involving some innate or 
predetermined stimulus dimension. Secondary generalization refers to generaliza¬ 
tion in which the dimension of similarity is acquired, or learned. For instance, the 
words house and home are similar, but that similarity is not innate. It is learned. 
Generalization across these two stimuli would be considered secondary gener¬ 
alization. 

SEMANTIC CONDITIONING. The semantic conditioning effect (see Chapter 2) can 
be used to illustrate the concept of secondary generalization. In a semantic 
conditioning study a response is conditioned to one verbal unit, such as a word. 
The subject is then presented with additional words, which vary in their similarity 
to the original training word. The greater the similarity, the greater is the tenden¬ 
cy for the test word to elicit the response (Maltzman, 1977). For example, Lacey 
and his associates (Lacey Smith, 1954; Lacey, Smith, & Green, 1955) ran a 
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series of studies in which subjects free-associated to long lists of words. The lists 
contained, among others, several words pertaining to rural life (cow, corn, plow, 
tractor). One of these words was selected by the experimenter as the training 
word. After each presentation of this critical training word the subjects were 
shocked and their heart rate monitored. The results indicated that the mere 
presentation of a word that had been previously followed by shock would yield 
an increase in heart rate. In addition, and more to our point, the heart-rate 
response was elicited by the other related rural words. If the presentation of plow 
was followed by a shock, then the presentation of corn would produce an 
increase in heart rate even though corn had never been followed by shock. This 
result demonstrates generalization along a dimension of learned meaning similar¬ 
ity. Parenthetically, it also demonstrates the complexity of generalization effects 
in human activity. 

ACQUIRED EQUIVALENCE OF CUES. Another pertinent area of research involves 
the acquired equivalence of cues. In a typical experiment (Jeffrey, 1953) children 
were taught to move a lever in one direction when a white stimulus was presented 
and in another direction when a black stimulus was presented. Half the subjects 
were then taught to call a gray stimulus white, whereas the other half learned to 
label the same gray stimulus black. All subjects then were presented with a 
random sequence of white, black, and gray stimuli and were required to move the 
lever in response to each stimulus. When the gray stimulus was presented to 
children who had learned to call it white, they moved the lever in the direction 
appropriate for white. Similarly, children who had learned to call the gray 
stimulus black moved the lever in the direction appropriate for black. Through 
learning, gray and white (or gray and black) had been “made similar.” Through 
stimulus generalization the lever responses originally connected to the white or 
black stimuli were elicited by the gray stimulus. 

Semantic generalization, or semantic conditioning, and acquired equivalence of 
cues are quite closely related. One difference between the two is that in semantic- 
generalization studies the learned similarity between the stimuli is usually estab¬ 
lished before the subject enters the test situation. In the acquired-equivalence 
paradigm the learned equivalence is actually established within the experimental 
situation (see also Ellis, 1973; Malloy & Ellis, 1970; Saltz, 1971). 

Generalization of Extinction 

Thus far we have been speaking as though only positive tendencies to make 
responses generalize. This is not so. Tendencies not to make particular responses 
can generalize, too. If a child has learned not to talk in one particular classroom, 
then that tendency not to talk can generalize to other similar situations. 

In a three-step laboratory demonstration an organism is first trained to respond 
equally strongly to a number of different stimuli selected from a dimension of 
stimuli. For example', a pigeon might be reinforced for pecking 10 different circle 
sizes, so that it would be very likely to peck each and every one of them with a 
very high probability. Then in the second step one of the circle sizes is randomly 
selected for extinction. The pigeon is exposed to this one stimulus repeatedly and 
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not reinforced for responding to it. Obviously, the animal stops responding to this 
particular stimulus; it extinguishes. Then in the third phase of the demonstration 
the pigeon is tested on the remaining nine stimuli, which it has not seen since the 
first step in the experiment, when it learned to peck all of them. What is found in 
this final test phase is that the tendency not to peck the stimulus value selected for 
extinction generalizes, or spreads to these other stimuli. The pigeon will show a 
reduced probability of responding to circles whose sizes are similar to the one 
circle selected for extinction. 

Behavior Modification and Generalization 
Generalization is an important issue in applied psychological work. For example, 
a critical issue in the field of behavior modification is whether behavior trained in 
the laboratory will generalize to everyday life. If we put an alcoholic through 
avoidance conditioning, will he, as soon as we release him from the experimental 
situation, head for the nearest bar? Will a mute child, trained to speak in a tightly 
controlled situation, fall mute again once she is removed from that situation? 

Some studies report fairly good generalization, whereas others do not (see 
Barton 8t Ascione, 1979; Coleman, Whitman, Johnson, 1979; Geller, 1983; 
Halle, Marshall, & Spradlin, 1979). For example, Tracey, Briddell, and Wilson 
(1974) awarded tokens each time psychiatric patients said positive things about 
other people or about hospital activities such as dances and the like. In the 
training condition positive statements about both people and activities increased 
substantially. But the important question was whether these increases would 
carry over into other activities. Some of them did, and some did not. On the one 
hand, an increase in positive statements about hospital activities actually led to 
increased attendance at activities. In other words, if patients were rewarded for 
saying positive things about the hospital dances, they tended to go to the actual 
dances more often. That is a pretty strong indication of the generalization of a 
learned response system. On the other hand, rewarding patients for making 
positive statements about people in the training situation did not appear to lead to 
more positive statements about people outside the training situation. This 
equivocal pattern of results is indicative of the problems facing investigators in 
this area. 

Stevens-Long and Rasmussen (1974) report that autistic children trained in the 
use of simple and compound sentences sometimes produce novel sentences on 
their own. In other words, language training may generalize in some complex 
manner, allowing the children to use the acquired rules and information in the 
construction of new sentences. 

And finally, Kleitsch, Whitman, and Santos (1983) report that increased verbal 
interaction among elderly retarded patients showed very good generalization. The 
patients talked more and in many different situations. In addition, nonsubject 
patients began to talk more, too! 

In summary, many behavior-modification studies show good generalization 
beyond the experimental setting, but in other studies generalization fails to 
materialize. The importance of generalization and its elusiveness suggest that it 
should be made one of the critical concerns in any behavior-modification study. If 
a behavior change is limited to the experimental setting, then in a practical sense it 
is of little interest or value. 
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DISCRIMINATION AND STIMULUS CONTROL 

Generalization and discrimination may well be opposite sides of the same coin. 
They may bear an inverse relationship to each other. A decrease in stimulus 
generalization refers to a decrease in the tendency to give the same response to 
similar stimuli. An increase in discrimination refers to the same thing—the 
tendency to restrict responses to one stimulus value chosen from a set of similar 
stimuli. Discrimination refers to a breakdown in the tendency to generalize. It is 
difficult, if not impossible, to untangle the two processes. Although their exact 
relationship has never been defined to everyone’s satisfaction, some experimental 
work supports the idea that they are closely related (Ganz, 1962; Thomas & 
Mitchell, 1962). 

Many modern researchers like to think of both generalization and discrimina¬ 
tion in terms of stimulus control , or the tendency for stimuli to elicit responses. 
Generally speaking, generalization refers to relatively imprecise stimulus control, 
whereas discrimination refers to those processes reflecting the tightening, or 
sharpening, of stimulus control. 

When presented with two stimuli—say, two lights of different color—and 
rewarded for responding to one but not the other, the animal will most often 
behave in the manner depicted in Figure 7.6. Notice that initially the animal 
makes few responses at all. But then, as it discovers that there is something good 
about the situation (the animal is receiving reinforcement), it begins to respond to 
both stimuli about equally frequently. It is only with a little additional experience 
that the animal begins to realize that one stimulus is associated with reward and 
the other is not. After the initial tendency to respond to either stimulus the animal 
begins to drop the response to the negative stimulus and increase its responses to 
the positive stimulus. 

On the one hand, the environment sometimes demands, or reinforces, gener¬ 
alization. For example, to function successfully and maximize reinforcement one 
must be able to generalize along such dimensions as red lights. On the other hand, 
the world sometimes rewards discrimination rather than generalization. For 
example, the successful wine taster is not one who says, “Hey, all this stuff tastes 
the same.” He is the one who can make finer and finer discrimination among very 
similar stimuli. (Amateur wine tasters often unknowingly discriminate among 
labels and colors rather than actual taste differences. Blindfold your local wine 
expert, and you may well find that he or she has trouble distinguishing one wine 
from another.) The successful animal is one that can, depending on the demands 
of the environment, both generalize and discriminate. We must be able to distin¬ 
guish between those occasions that require generalization and those that demand 
discrimination. Too much of either process will lead to less than maximum 
reinforcement. 

The mechanism whereby generalization is reduced, or discrimination is in¬ 
creased, involves the reinforcement of responses made to one particular stimulus 
and the nonreinforcement of responses made to similar stimuli. This is what we 
mean by differential reinforcement. For example, the baby comes to distinguish 
her father from other men through a process of differential reinforcement. Her 
initial tendency to generalize, or to say “Dada” to all men, is reduced by 
reinforcing “Dada” in response to her father and not reinforcing it when it occurs 
in response to other men. 
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Figure 7.6 The course of discrimination learning. After an initial period, during which 
the probability of a response to both the positive and negative stimulus increases, the 
response to the negative stimulus drops out while the response to the positive stimulus 
continues to strengthen. (These are hypothetical data.) 

Simultaneous Versus Successive Methods 
Two of the more common methods used in the investigation of discrimination 
learning have been labeled the simultaneous and the successive presentation 
techniques (Crowell & Bernhardt, 1979; Pullen & Turney, 1977). In the simul¬ 
taneous method two stimuli are presented together. A pigeon might be presented 
with two different illuminated keys. A response to one (the positive stimulus) 
would be reinforced. A response to the other (the negative stimulus) would not be 
reinforced. With repeated differential reinforcement the pigeon will come to 
restrict its pecking response to the positive stimulus. 

The successive method involves the sequential presentation of the positive and 
negative stimuli. The two stimuli are presented one at a time in some random 
order. The animal is free to respond or not respond to each stimulus as it appears. 
This would be comparable to a situation in which the baby saw either her father 
or her uncle but never both at the same time. 

The Algebraic Summation Interpretation 
Hull (1943) and Spence (1936, 1937) developed what has come to be known as 
the absolute, or the algebraic summation , interpretation of discrimination learn¬ 
ing. Imagine a dimension of circle size (see Figure 7.7). Choose from this dimen¬ 
sion two particular circle sizes. Call one the positive stimulus and the other the 
negative stimulus. Present these two circles, and only these two, to a pigeon. 
Reinforce the pigeon when it pecks the positive stimulus. Do nothing when it 
pecks the negative stimulus. According to Hull and Spence, each time the pigeon 
responds to the positive stimulus and is reinforced for doing so, there results an 
increase in the tendency to respond to that stimulus (see the upward arrow' in 
Figure 7.7). Each time the pigeon pecks the negative stimulus and is not rein¬ 
forced for doing so, there is an increase in the tendency not to respond to that 
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Figure 7.7 An example of the Hullian interpretation of discrimination learning. 

particular stimulus value (see the downward arrow). The positive tendency to 
respond to the reinforced stimulus generalizes (see the positive gradient in Figure 
7.7) along the stimulus dimension. Similarly, the negative tendency also general¬ 
izes (see the negative gradient). These two gradients, the positive and the negative, 
sum algebraically. The resulting net gradient in Figure 7.7 is the result of the 
algebraic interaction of the positive and negative gradients. The probability of a 
response’s occurring at any point along the dimension is determined by the 
strength of the competing positive and negative tendencies at that point. For 
example, consider stimulus X in Figure 7.7. At this point the generalized strength 
of the negative tendency is considerable, but it is not as great as the generalized 
positive tendency. Hence, if we present circle X to our pigeon, it will probably 
peck it. The positive tendency outweighs the value of the negative tendency at that 
point. The net gradient summarizes the resolution of the opposing positive and 
negative tendencies pulling on the animal. 

It is an interesting model, and it leads to many testable hypotheses. For 
example, what will happen to the probability of the response to the positive 
stimulus as we increase the number of unreinforced responses that the animal 
makes to the negative stimulus? What will happen to a baby’s tendency to say 
“Dada” to her father as we increase the number of times she says “Dada” to her 
uncle and is not reinforced for doing so? On an intuitive basis we might suspect 
that such training might sharpen the baby’s perception of the differences between 
her father and her uncle and lead to a stronger tendency to respond to the father. 
But notice carefully what the Hullian model suggests. If we increase the number 
of unreinforced responses to the negative stimulus, the negative gradient in¬ 
creases. This increase detracts from the tendency to respond to the positive 
stimulus. Through repeated nonreinforced responding to the uncle we would 
expect the baby’s tendency to say “Dada” to her father to decrease rather than 
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increase. This curious and somewhat unexpected result has received support in 
the laboratory (see Gynther, 1957; Purtle, 1973). 

Relational Theory: A Cognitive Approach 
The Hullian theory just described has been challenged by what is called the 
relational theory of discrimination learning (Kohler, 1955). According to this 
position, the animal in forming a discrimination learns a relation between the 
positive and negative stimuli. When discriminating between two circle sizes the 
pigeon may well be attending to the fact that one stimulus is larger than the other. 
In a sense it may be learning that “bigness is goodness.” The emphasis here is on 
the cognitive, somewhat fuzzy, concept of relationship learning rather than the 
summation of independently established positive and negative gradients. The 
relational position emphasizes a comparison process and suggests that the ani¬ 
mal’s response to the stimuli is based on a perceived relation between them. 
Although somewhat ambiguous, the relational theory may well yield predictions 
that differ from those generated by the Hullian scheme. 

The first differential prediction has to do with a comparison of the simul¬ 
taneous and the successive methods just outlined. If the relational position is 
correct, we might expect that the simultaneous method would produce better 
discrimination learning than the successive method, simply because the rela¬ 
tionship between the two stimuli might be more easily perceived. If they were 
presented together, the perception of the relation would not be dependent on a 
“memory” of one of the stimuli, as would be the case in the successive presenta¬ 
tion method. The experimental data, however, contain all possible results with 
respect to this question. Neither method seems to be superior under all conditions 
(see Blough & Lipsitt, 1971; Kimble, 1961). Thus, although the theory seems to 
generate a distinctive hypothesis, nature and our experiments have not been 
particularly cooperative in yielding a definitive answer to the question. 

Transposition 

The transposition effect is another phenomenon that bears upon the relational- 
versus-absolute issue. Suppose that an animal has been trained to discriminate 
between a dark stimulus (the positive stimulus) and a light stimulus (the negative 
stimulus). Now suppose that we remove the light stimulus from the situation and 
pair the dark positive stimulus with a new, heretofore unseen, darker stimulus. 
The animal’s choice is now between the old positive stimulus, which has been 
consistently reinforced, and the new darker stimulus, which has never even been 
seen before. Common sense would argue that the animal would continue to 
respond to the old positive stimulus. But the relational position would argue that, 
if the animal has, in fact, learned the relation between the light (negative) and the 
dark (positive) stimuli and responds on the basis of that relation, then it should 
respond to the new darker stimulus rather than the old positive stimulus when 
given a choice between these two. It should have learned that “darkness is 
goodness.” This is exactly what happens. The animal responds to the new, darker 
stimulus rather than the old, positive one. This transposition effect has been 
observed with all sorts of animal under many different conditions (see Reese, 
1968; Riley, 1968). . 
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Figure 7.8 Spence’s analysis of transposition. During training, responses to stimulus 256 
are reinforced and responses to stimulus 160 are extinguished. (Adapted from Spence, 

K. W. The differential response in animals to stimuli varying within a single dimension. 
Psychological Review, 1937, 44, 430—444. Fig. 1. Copyright 1937 by the American 
Psychological Association.) 

The transposition effect seems to support the relational position. It suggests 
that the animal is responding on the basis of a perceived relationship rather than 
discrete stimulus—response connections. And yet Spence (1937), in a classic analy¬ 
sis, argues that simple S—R relationships can, after all, account for transposition 
effects. 

According to his model (see Figure 7.8), a positive excitatory gradient is built 
up around the positive stimulus (value 256). A negative gradient (dotted line) is 
built up in connection with the negative stimulus (value 160). The probability of 
responding to any point along the stimulus dimension depends on the difference 
between the positive and negative gradients. If you will spend a moment with this 
model, you will realize that it predicts transposition effects very nicely. For 
example, suppose that the old positive stimulus (256) was paired with a new test 
stimulus (say, of value 655). According to the model, the probability of respond¬ 
ing to stimulus 655 would be greater than the probability of responding to the old 
positive stimulus (256). This would be a transposition effect, and it is predicted 
on the basis of discrete S-R connections without any reference to the learning of 
relationships. 

Spence’s analysis still stands as one of the more ingenious interpretations of 
transposition. In effect, it broke the strong grip that the relational theorists had 
had on the transposition effect since the beginning of the century. This is not to 
say that Spence’s model is perfect, either. For example, it is dependent on how 
one draws the hypothetical gradients. If one wishes, one can redraw the gradients 
so that transposition is not predicted. In addition, there are certain facts that his 
model does not seem to explain (see Honig, 1965). There have also been some 
fairly recent attempts to revive the relational interpretation (Zeiler, 1967). 

Noncontinuity Theory: Further Cognitive Factors 
In addition to the relational theory there has been a second challenge of the 
Hull—Spence interpretation of discrimination formation. This approach is usually 
labeled noncontinuity theory (Krechevsky, 1932). According to this position, 
which has never been as precisely stated as the Hullian theory, the organism 


202 PART THREE/TRANSFER 





















solves a discrimination problem by trying out and abandoning successive hypoth¬ 
eses until the correct one is discovered, perhaps all at once. For example, a rat 
faced with two doors, one of which leads to food, might first hypothesize that the 
left door was the correct one. If this failed, the rat would drop that hypothesis and 
try another, such as “the black door is correct,” and so on, until the solution to 
the problem was found. 

In other words, Hull and Spence emphasized the gradual, continuous accu¬ 
mulation of habit strengths, whereas Krechevsky emphasized the idea that a 
discrimination problem may be solved all at once through a process of hypothesis 
testing. 

Tests of these opposing views have usually used a stimulus reversal procedure. 
Specifically, a rat will be given several trials on a black—white discrimination 
problem where black is correct. During this presolution period the animal is given 
some experience with the problem but not enough to actually solve it. Then the 
experimenter reverses the cues, or makes the white stimulus the positive stimulus 
and the black the negative stimulus. 

What would the two theories predict here? Continuity theory (Hullian) predicts 
that the reversed problem would be difficult, because during the presolution 
period the animal was slowly and continuously learning that black was correct 
but now black is incorrect. Noncontinuity theory, in contrast, predicts that the 
presolution experience should have no impact on learning of the reversed prob¬ 
lem. The rat will have learned nothing that will affect its ability to learn the 
reversal. 

Most of the experiments using the reversal procedure seem to support the 
continuity position; reversing cues does seem to retard learning (Mackintosh, 
1974; Sutherland & Mackintosh, 1971). 

Continuity theory is not “home free,” however, because of what is called the 
overlearning reversal effect. Let’s suppose that in the experiment described above 
we give the rat many, many trials before reversing the cues, instead of just a few 
trials during a presolution period. In effect, we allow the animal to overlearn the 
fact that black is positive before reversing the situation. Hullian continuity theory 
predicts that the animal will have a great deal of trouble learning the reversal. But 
in fact, the data indicate that overlearning speeds up the learning of the reversal 
(Mackintosh, 1974). 

The upshot of all of this is the current suspicion that discrimination learning 
may involve, under certain conditions, both continuous and noncontinuous sub¬ 
processes. It would be an oversimplification to say that either gradually increasing 
habit strengths or hypothesis testing can completely account for discrimination 
learning. 


Behavioral Contrast 

Having spent some time considering major theories of discrimination we now 
turn to a discussion of some additional issues relating to the discrimination 
process. The behavioral contrast effect is an interesting one. It was investigated by 
Reynolds (1961a, 1961b) and others some time ago and is still undergoing rather 
extensive investigation (for example, Keller, 1974; Redford & Perkins, 1974; 
Williams, 1977). The work done by Reynolds provides a good example of the 
effect. Pigeons were trained to peck a key. Sometimes the key was red, and 
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sometimes it was green. At first the color made no difference, for the pigeon was 
reinforced regardless of the color of the key. In this phase of the experiment the 
pigeon pecked the two colors equally frequently. In the second phase of the study 
the red stimulus became the positive, or reinforced, stimulus, and the green cue 
became the negative, or unreinforced, stimulus. As one might expect, the response 
rate to the green key dropped. Responses to this key went unreinforced, and 
hence they extinguished. But what about the rate of response to the red key? 
Remember that a response to this key was treated identically in the two phases— 
it was always reinforced. The shift from the first to the second phase did not 
involve a change with respect to the red key. In spite of this constant treatment the 
rate of response to the red stimulus went up during the second phase. This is the 
contrast effect (see also Lombardi & Flaherty, 1978; Rovee—Collier & Capatides, 
1979; Woodruff, 1979). 

How might one explain this effect? Why did the pigeon increase its response to 
the red key even though the size and frequency of the reinforcements stayed the 
same? Let us first look at what the Hull—Spence model would state about the 
issue. It seems that their model would predict a decrease in the rate of response to 
the red key rather than an increase. With the shift to phase two a negative, or 
inhibitory, tendency would be assumed to build up in association with the 
unreinforced green key. This negative tendency should generalize and detract 
from the positive tendency to respond to the red key. Thus, this aspect of the 
Hull—Spence model does not seem to account for the behavioral contrast effect. 

What other ways are there to think about the contrast effect? Williams (1973) 
and others suggest that frustration may be involved. They argue that the animal 
experiences frustration when it pecks the green key and receives no reinforcement 
in phase two. (We really have no idea of the nature of the pigeon’s subjective 
experience. We call it frustration for convenience and assume that it is similar to 
what we experience when we put a quarter in the coffee machine and coffee, but 
no cup, is returned.) Call it what you like, it is assumed to act as a drive, or 
motivating, agent. Frustration activates and motivates the pigeon so that it 
increases its behavior in the presence of the red stimulus. 

It is clear that the behavioral contrast effect does occur. Why it occurs is 
another question. The following discussion lends some support to the frustration 
hypothesis. 


Errorless Discrimination Learning 

lerrace (1961,1963a, 1963b, 1972) devised a method for discrimination learning 
in which the animal never actually responds to the negative stimulus. In simple 
terms, Terrace first establishes a strong response to the positive stimulus alone, 
and then he very, very gradually introduces the negative stimulus into the situa¬ 
tion. He brings the negative stimulus in so slowly, in such small steps, that the 
pigeon never responds to it. For example, pigeons can be trained to discriminate a 
red key from a green key (negative stimulus) without ever making an unreinforced 
response to the green key. First, the pigeon is trained to peck the red key alone. 
Then during the presentation of the illuminated red key the light in the key is 
momentarily turned off. During these brief dark intervals the pigeon typically 
draws away from the key. The dark intervals may gradually be lengthened 


204 PA RT TH R EE/TR ANSEE R 









without the pigeon’s responding. Then a very dim green light is presented instead 
of the totally dark key. The pigeon still does not respond to it, even though it is 
pecking resourcefully at the key when it is red. Very gradually, the intensity and 
the length of the green light can be increased. Eventually, the key may be green 
half the time and red half the time. Birds trained in this manner will produce few, 
if any, responses to the green stimulus. The complete discrimination is formed 
without errors (see also Doran &c Holland, 1979; Fields, 1979). (It should be kept 
in mind that, although the effect has been obtained many times, it is difficult to 
establish. Terrace, in fact, describes some situations in which it seems impossible 
to establish errorless discrimination learning.) 

The frustration interpretation of the behavioral contrast effect and errorless 
discrimination learning are interconnected. If an animal forms a discrimination 
without ever experiencing the “disappointment” that follows an unreinforced 
response, it will not experience frustration. If an animal is trained under the 
errorless method, it never makes unreinforced responses to the negative stimulus. 
If it never experiences nonreinforcement, it is never frustrated. Thus, if we look 
for the behavioral contrast effect in an errorless-trained pigeon, we should not 
expect to find it. If the errorless-trained pigeon does not experience frustration, 
there will be no frustration to motivate it, or to goad it into responding more 
frequently to the positive stimulus. Terrace (1964) obtained just this result. He 
compared contrast effects obtained with errorless-trained birds and normally 
trained birds. A greater contrast effect appeared under normal (frustrating) 
conditions. This result lends some support to the frustration interpretation of the 
behavioral contrast effect. However, there have been some more recent failures to 
find this effect (Rilling 6c Caplan, 1975). 

Insoluble Discrimination Problems 

Thus far we have been speaking of discrimination problems that the animal can 
solve. Through experience it can maximize reinforcement by responding to one 
particular stimulus and not responding to another. What happens to its behavior 
if the problem is too difficult, or even impossible, for the animal to solve? For 
instance, there is a point beyond which we cannot detect differences between the 
brightness of two lights. What happens to our behavior if we are forced to 
respond in such an impossible situation, receiving reward for correct responses 
and perhaps punishment for incorrect ones? 

There are two experimental techniques for observing behavior in insoluble 
discrimination situations. In the first method the animal can perceive the differ¬ 
ences between the two stimuli, but reward is randomly associated with them 
(Maier, Glazer, 6c Klee, 1940; Maier 6c Klee, 1945). The animal can never be sure 
which stimulus to respond to. For example, Maier and his associates used a 
Lashley jumping stand, which, if you will recall, requires the animal to leap across 
an open space and through one of two little doors. Typically, one door is white 
and the other black. In a normal, soluble discrimination situation one color will 
be the correct choice and the other will be incorrect. If the animal jumps toward 
the correct door, the door opens, and the rat leaps through to safety. If the rat 
jumps at the incorrect door, the door remains closed; the animal bumps into it 
and drops down into a net. (As you know, the typical procedure is to switch the 


Cl IAPTKR 7/GENI'RAI l/.ATION AND DISCRIMINATION 205 



color cues from left to right to ensure that the animal learns a color rather than a 
position discrimination.) In this soluble situation the animal easily learns the 
discrimination. 

Things are not so simple in the insoluble situation. In this condition the 
experimenter may reinforce the animal (allow safe passage through the door) half 
the time, regardless of whether the animal jumps to the right or to the left. There 
is no way that the animal can solve the problem. The animal tries, but it is 
impossible to obtain reinforcement all the time. Maier found that rats will 
develop some very peculiar behaviors in this situation. Typically they begin to 
respond to one side or the other. One rat might jump to the left no matter what 
the color. Another might fixate on the righthand door. (You might ask why the 
rat jumps at all. Simple. The experimenter applies shock until the rat leaps.) 
Maier describes this fixated jumping as “perseverative responding.” We might 
ask: “So what? As long as there is no way to solve the problem, why not jump to 
one side? Jumping to one side results in as much reinforcement as switching from 
side to side.” The interesting aspect of the situation develops when we suddenly 
make the problem soluble. That is, we may begin reinforcing every response to 
the black and punishing every response to the white. A naive rat would learn this 
discrimination quickly. But a fixated rat does not readjust its behavior even 
though the cues necessary for such a readjustment are there. This fixated behavior 
will persist through hundreds of trials. 

The second paradigm that involves insoluble discrimination problems has been 
labeled the “experimental neurosis” situation. In the Maier situation just de¬ 
scribed, reinforcement is randomly associated with distinct stimuli. In the ex¬ 
perimental neurosis paradigm, reinforcement is consistently associated with one 
of two stimuli, but the stimuli become so similar the animal cannot tell them 
apart. For example, Brown (1942) trained rats to distinguish between lights of 
two degrees of brightness. An approach to the light one, or a retreat from the 
dimmer one, was reinforced with food. Incorrect responses were punished with 
shock. After the discrimination had been established, Brown began to make the 
two stimuli more and more similar. It became increasingly difficult for the 
animals to distinguish between the two stimuli. The animals became excited; they 
trembled, defecated, urinated, and leaped about. Some animals even displayed 
convulsive behavior. Pavlov (1927) demonstrated similar behaviors using dogs 
and salivary responses. The dogs were forced to discriminate between a circle and 
an ellipse. The ellipse was then made progressively more like the circle. As the two 
stimuli became very similar, the dogs began to salivate at the sight of the 
apparatus. They whined, barked, and tried to leap free of the restraining appara¬ 
tus. These so-called “neurotic” behaviors persisted beyond the confines of the 
experimental situation. 

The parallel between these experimental effects and human experiences is 
pretty straightforward. For example, if overly ambitious parents push a child in 
his school work, they may exceed his abilities. Behavior problems may result, and 
the child may become disruptive and unhappy. If he is pressed to respond 
differentially to stimuli that are beyond his ability to discriminate or that seem to 
him to be the same thing, then he may break down. For example, if parents 
consistently demand that a normal 7-year-old distinguish between such words as 
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laziness and idleness , they may be exceeding his capacities, and they may force 
him into a state of agitation and discomfort. 


DISCRIMINATION IN HUMANS 
Human Processes: Verbal and Nonverbal 
We have discussed many discrimination experiments, and most of them have 
involved animals as subjects. But because our primary concern is with humans, 
rather than rats or pigeons, we should spend a little time considering the role of 
discrimination processes in human verbal learning tasks. That is not to suggest 
that all research concerning human discrimination learning has to do with verbal 
tasks. Quite to the contrary, the investigation of the human’s ability to discrimi¬ 
nate along nonverbal dimensions is actively continuing (see Hansen, Tomie, 
Thomas, & Thomas, 1974; Hebert, Bullock, Levitt, Woodward, & McGuirk, 
1974; Thomas, Svinicki, & Vogt, 1973). These and other investigators have been 
concerned with the human’s ability to discriminate points along such dimensions 
as weight, brightness, size, and area. In addition, a sizable literature has been built 
up around the question of our ability to discriminate among pictures. For exam¬ 
ple, recent efforts have revealed that subjects discriminate among pictures more 
easily than among verbal labels of those pictures (Levin, Ghatala, Wilder, 1974; 
Rowe, 1972; Wilder & Levin, 1973). In a later chapter we will devote consider¬ 
able attention to the overall issue of visual versus verbal memory and humans’ 
abilities to process and code visually presented materials. But for now we shall 
concentrate on verbal discrimination. 

The importance of discrimination in human verbal learning is emphasized by 
the stage analysis of associative learning (Underwood &C Schulz, 1960). According 
to this analysis, simple verbal learning involves at least three subprocesses, each of 
which is essentially a discrimination process. As an example, consider subjects 
trying to master a paired-associate list. First, they must discriminate the stimuli, 
one from the other. This is easy if the stimuli are something like common words. 
But if the stimuli are QZKX, QKXZ, XQZK, KZXQ, and so on, the subjects 
have a problem. In order to hook responses to these stimuli the subjects must sort 
them out and distinguish one from the other. This is called stimulus learning. 

Second, the subjects must make similar discriminations among the responses. If 
the responses are QZK, KZQ, and ZKQ, they must get them clear in their mind 
before paired-associate learning can be completed. This is called response learn¬ 
ing and obviously involves making discriminations. 

Finally, in associative learning the subjects must make the discriminations 
necessary to join together the correct stimuli and responses. 

Clearly, the stage analysis illustrates the fact that even relatively simple verbal 
tasks involve many complex and interrelated discriminations. 


Verbal-Discrimination Learning 

The verbal discrimination (VD) paradigm (Ekstrand, Wallace, &C Underwood, 
1966; Kausler & Yadrick, 1977; Underwood, Jesse, & Ekstrand, 1964) represents 
a direct extension of the discrimination paradigm into the field of verbal learning. 
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LABORATORY EXPERIMENTS. In a typical VD experiment the subjects are pre¬ 
sented with a long series of pairs of verbal items (for example, words). One 
member of each pair has been arbitrarily designated as the “correct” item by the 
experimenter. The subjects’ task is to identify the correct items. They do not have 
to recall the items, nor do they have to associate them with any other item. Their 
sole task is to identify which of the two items within each pair is the “correct” 
item. They guess at first, and if they are correct, an experimenter so informs them; 
they are also made aware of their errors. At first, not knowing which of the two 
items is correct, the subjects will respond to therh somewhat randomly. As trials 
progress and the subjects are “reinforced” through response confirmation, they 
come to restrict their choice to the correct items. It seems quite similar to the 
pigeon pecking colored lights for food. 

One difference between the VD procedure and typical animal discrimination 
techniques is that the humans are faced with the task of forming multiple, and 
almost simultaneous, discriminations. The pigeons are faced with only two stim¬ 
uli. The humans are presented with a long list of pairs, each of which contains a 
correct and an incorrect stimulus. If presented with a single pair, the humans 
would solve the problem in one trial. In order to study the gradual formation of a 
discrimination in humans, the experimenter must make the task more difficult 
than that used with animals. The human task is rendered more difficult by adding 
additional pairs of stimuli. 

This does not necessarily mean that the human solves discrimination problems 
in a manner fundamentally different from that of the pigeon. It may merely mean 
that the human is far superior at forming the same sorts of discrimination. 

FREQUENCY THEORY. Aside from parametric investigations, much of the work 
done with VD has centered on the “frequency theory” interpretation of VD 
effects (Hasher & Chromiak, 1977; Underwood & Freund, 1970; Underwood, 
Jesse, & Ekstrand, 1964). According to frequency theory, the discrimination 
between right and wrong items is made on the basis of the relative frequencies of 
the members of each pair. Basic to this theory is the idea that items acquire 
frequency units each time the subjects respond to them. The more frequency units 
an item has acquired, the more likely the subjects are to label that item as the 
correct one. Ekstrand, Wallace, and Underwood (1966) have clarified what is 
meant by frequency units. There are a number of different types of response that 
can add frequency units to an item. First, perceiving an item is one type of 
response that is presumed to add frequency units to that item. Because the 
subjects probably perceive both the right and the wrong items on each trial, it is 
presumed that this type of response adds frequency units to both right and wrong 
items. Second, pronouncing an item adds frequency units. Each time the subjects 
say the correct item, an additional frequency unit is added to that item. The third, 
and perhaps most important, type of response that adds frequency units is the 
rehearsal response. These responses presumably occur during the feedback inter¬ 
val when the correct item is clearly identified for the subjects. The subjects 
rehearse the correct item at this point, adding frequency units. They therefore 
increase the probability of identifying that item as the correct item in the future. 
Presumably the right item will be rehearsed much more than the wrong item. 
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Hence, a differential number of frequency units begins to build up in connection 
with the right and wrong items. 

RESPONSE DIFFERENTIATION 

Discrimination, as we have seen, refers to the situation wherein the subject limits 
a given response to one of many similar stimuli. Response differentiation , in 
contrast, refers to the situation in which the subject comes to give one of several 
similar responses to a specific stimulus situation. For example, when a pigeon is 
first reinforced for pecking a key, its pecking behavior may be somewhat varied 
and haphazard. It pecks here and there; some pecks are weak, and others are 
strong. As training progresses, the responses become more consistent. The peck¬ 
ing behavior is shaped to the point at which it maximizes reinforcement. The 
pigeon differentiates between more and less successful responses and restricts 
itself to those that are maximally efficient. When babies first begin to speak, 
their words are poorly framed. Parents praise them when their responses are 
accurate and withhold reinforcement when they are not. The responses gradually 
become differentiated. Poorly vocalized instances drop out. The probability of 
clearly enunciated sounds, or those that will most likely result in reinforcement, 
increases. 

By and large, more experimental work has been done with discrimination than 
with response differentiation. But there are some available examples of response 
differentiation in the literature. The concept of shaping, wherein the experimenter 
gradually induces the subject to emit a particular response by reinforcing succes¬ 
sive approximations to that response, has already been discussed. Verbal con¬ 
ditioning represents another area of research that encompasses the concept of 
response differentiation. In a typical verbal-conditioning experiment (Green- 
spoon, 1955) the subjects are asked to say words as they come to mind. As the 
subjects respond, the experimenter murmurs “mm-hmm” (or some other subtle 
form of reinforcement) each time they say a plural word. Control subjects do not 
receive this form of social reinforcement. Greenspoon found evidence for an 
increase in the rate of plural word production in the experimental group. In other 
words, the subjects were able to differentiate between responses that were effec¬ 
tive in producing a reinforcement (plural) and those that were not effective 
(singular). Other varieties of verbal conditioning (for example, Taffel, 1955) have 
required the subjects to make up sentences using one of six pronouns. The 
experimenter reinforced the choice of one arbitarily designated pronoun and did 
not respond when one of the remaining pronouns was used. Taffel found that the 
use of the reinforced pronoun increased during the experiment. 

The controversy that has swirled about the verbal-conditioning effect has to do 
with the subjects’ awareness of what is going on in the experiment. Some authors 
(Spielberger, 1965) suggest that subjects who exhibit the verbal-conditioning 
effect are usually aware of the conditions of the experiment. They know, or have 
discovered, that the experimenter is reinforcing a particular kind of response. 
These authors also suggest that subjects who are unaware of the reinforcement 
contingency usually fail to condition. Other investigators, however, appear to 
have demonstrated verbal conditioning in the absence of awareness (see Kennedy, 
1971). Rosenfeld and Baer (1969) ran an experiment in which the experimenter 
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was really the subject and the subject was really the experimenter. One individual 
was informed that he was to be an experimenter. During an interview with a 
subject he was to reinforce the subject’s chin-rubbing response by nodding his 
head each time it occurred. In reality, this experimenter was the subject and the 
subject was the experimenter. The presumed subject (the true experimenter) was 
in league with Rosenfeld and Baer. During the interview the true experimenter 
was actually trying to condition the true subject. Each time the true subject said 
“yeah,” the true experimenter reinforced him by rubbing his chin. The results 
indicated that the true subject increased the rate at which he said “yeah” even 
though he thought he was the “experimenter” and in control of the situation. The 
true subject apparently had no idea that he was being conditioned, and yet his 
response rate went up. 

With this classic example of devious psychological experimentation we come to 
the end of our discussion of stimulus generalization, discrimination, and the 
related process of response differentiation. In the next section we shall move on to 
another related topic— transfer of training. We shall concentrate on the impact 
that a given instance of learning may have on subsequent instances of learning. 
The overlap between transfer effects and generalization should become apparent 
as we trace through positive and negative transfer phenomena. 

TRANSFER OF TRAINING 

The effects of prior training on present training may be either positive or negative. 
For example, bad study habits acquired during high school may have a negative 
influence on attempts to establish better study habits in college. Learning one 
language may facilitate the acquisition of subsequent languages. Learning to drive 
a car certainly aids in learning to drive a truck. Learning to respond fearfully in 
one social setting may hinder attempts to adjust comfortably in a new set of social 
circumstances. Such transfer of training is discussed here because, as we shall see, 
it is closely related to stimulus generalization. Specifically, both transfer of 
training and stimulus generalization are concerned with the effects of prior 
learning on subsequent performance. 

An enormous body of literature has grown up around the concept of transfer of 
training. The basic transfer experiment involves two stages. The subject first 
learns one task (quite often a paired-associate list) and then attempts to master a 
second somewhat similar task. Within this simple, convenient framework any 
number of variables can be manipulated. Historically, the most important vari¬ 
able has been that of the similarity between the successive tasks. One of the 
primary concerns of this section will be to outline the complex relationship 
between similarity and transfer. 

Transfer Designs: Specific and Nonspecific Transfer 
Suppose that we are interested in what effects the learning of one paired-associate 
list has on the subsequent learning of a second list. Furthermore, suppose that the 
two lists compose what is.commonly known as the A-B, A-D paradigm. What 
this means is that the stimuli in the two successive lists (the A units) are identical, 
but the responses in the two successive lists (the B and D units) are different. One 
item from the first list might be valiant—rotten, and the corresponding pair in the 
second list might be valiant—complex. The subjects are required to associate 
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Table 7.1 Transfer Designs 



FIRST 

TASK 

SECOND 

TASK 


Design 1 

Experimental condition 

A-B 

A-D 


Control condition 

None 

A-D 


Design 2 

Experimental condition 

A-B 

A-D 


Control condition 

B-C 

A-D 



rotten with valiant in the first list and then to associate complex with the same 
stimulus term during second-list learning. We want to know if A—D learning will 
be helped or hindered by the A—B experience. On an intuitive basis we might 
suspect that the A-B learning would interfere with, or compete with, the subjects’ 
attempts to connect D to A. Each time the subjects try to come up with complex 
during A—D learning, they are hampered by the fact that rotten, so recently 
acquired, continues to pop into their mind. 

Intuitive notions aside, we have to find a way to measure the impact of A—B 
learning on A-D learning. We need a control group that learns the A-D but not 
the A—B list. This pattern is designated as Design 1 in Table 7.1. We can compare 
the ease with which the experimental and control groups master the A—D list. 
Typically, the experimental groups have a more difficult time of it in this particu¬ 
lar situation, confirming our intuitive suspicion that the A—B, A—D paradigm is a 
highly negative transfer paradigm, or a paradigm in which the learning of the 
second task is hindered by the learning of the first task. It is difficult to connect Ds 
to As after Bs have already been associated with As. 

It is at this juncture that the first of many complications arises. There are at 
least two transfer factors that contribute to the experimental subjects’ ability to 
master the A—D list. The first, as we have noted, has to do with the fact that the B 
responses probably compete with the D responses during A—D learning. This 
influence is termed specific transfer. It refers to the impact of specific first-list 
associations on attempts to establish specific second-list associations. In this 
particular paradigm the specific transfer effect is a negative, or disruptive, one. (In 
other situations, as we shall see, specific transfer effects may be positive.) But 
there is another type of transfer factor that contributes to the experimental 
subjects’ ability to acquire the A—D list. It is referred to as nonspecific transfer and 
is, in this particular paradigm, a positive influence. What is it about this Design 1 
that would give the experimental subjects an advantage over the control subjects 
in learning the A—D list? 

LEARNING TO LEARN. There are two nonspecific transfer factors that give the 
experimental subjects an advantage over the control subjects during A—D learn¬ 
ing. The experimental subjects have an advantage in that they have already 
learned one paired-associate list, whereas the control subjects have not. 1 he 
experimental subjects have undoubtedly already learned something about how to 
master a paired-associate list. They already know what to look for, what 
strategies to use, and how to hook up items in efficient ways. 

9 
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Figure 7.9 The learning-to-learn effect. Subjects learned sixteen successive lists, one per 
day. Notice that the last list was learned in about half as many trials as the first list. 
(Adapted from Ward, L. B. Reminiscence and learning rate. Psychological Monographs , 
1937, 49, (220). Copyright 1937 by the American Psychological Association.) 

The importance and power of learning to learn have been known for a long 
time. In a very early study, for example, Ward (1937) had subjects learn 16 lists, 
one on each of 16 successive days. As you can see in Figure 7.9, the subjects 
showed an enormous improvement in their ability to learn the successive lists. 
Whereas it took them almost 40 trials to learn the first list, they were able to learn 
the last, or 16th, list in fewer than 20 trials. Now that is learning to learn (see also 
Keppel, Postman, & Zavortink, 1968). 

WARM-UP. The second nonspecific transfer effect is called warm-up. At the time 
that experimental subjects in Design 1 of Table 7.1 begin A—D learning, they are 
already “warmed up” when compared with the control subjects. 

At the beginning of A—D learning the experimental subjects will not be as 
distracted as the control subjects. They have already established sensory, postural, 
and attitudinal sets to learn paired-associate lists. They are ready, whereas the 
control subjects must settle in. 

Learning to learn can be distinguished from warm-up in terms of its temporal 
persistence. Warm-up effects are presumed to be transitory. Once practice ceases, 
the advantage gained by having “warmed up” dissipates rapidly. The effects of 
learning to learn are, in contrast, assumed to be more permanent. Once subjects 
have developed efficient strategies and methods for mastering a particular learn¬ 
ing task, regardless of the specific content of the task, these strategies and 
methods may be maintained, or remembered, over time. 

ELIMINATING NONSPECIFIC TRANSFER. To return to our consideration of the 
A—B, A—D paradigm, it is clear that two transfer influences operate in the 
experimental condition. One is negative specific transfer, associated with corn- 
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petition between the A—B and A—D associations. The other is positive nonspecific 
transfer, identified with warm-up and learning to learn. Overall transfer in Design 
1 will be determined by both types of transfer. 

But suppose that we are interested only in specific transfer. We want to 
estimate it accurately, and we want our estimate to be free of nonspecific transfer 
effects. If we were to use Design 1, we would underestimate the amount of 
negative specific transfer. The negative specific transfer effect would be partially 
mitigated by the advantage that the experimental group gains over the control 
group through positive nonspecific transfer. Hence, we need an alternative design 
in which the control subjects are given an equal opportunity to warm up and 
learn to learn. Design 2 in Table 7.1 represents such a design. The control group 
first learns a list that is entirely unrelated to the A—D list. Hence, the control 
condition in this design is free from any specific transfer. But it does involve 
nonspecific transfer. The control subjects are allowed the opportunity to warm up 
and learn to learn. In Design 2 the control and experimental conditions are 
equated in terms of nonspecific transfer. They differ only in terms of specific 
transfer. Thus, any difference in the ease with which the experimental and control 
subjects learn the A—D list must result from specific transfer alone. The design is 
free from contamination by unwanted nonspecific factors (see also Postman, 
1969,1971; Postman & Schwartz, 1964; Thune, 1951). 

Specific Transfer: Similarity Effects 

Having outlined the manner in which psychologists isolate specific from nonspe¬ 
cific transfer, we now turn to a more detailed consideration of some of the more 
important specific transfer phenomena. The A—B, A—D paradigm just discussed is 
only one of many possible transfer patterns. For example, we can construct 
successive lists so that they conform to an A—B, A—B' pattern. In this paradigm 
the stimuli in successive lists are identical, and the responses are similar (B' is a 
unit that is similar or related to B). A pair from the first list might be valiant- 
scared. , whereas the corresponding A—B' pair would be valiant—frightened. If the 
similarity between the first-list and second-list responses is strong enough, this 
paradigm tends to yield positive transfer. When subjects are attempting to con¬ 
nect frightened to valiant, the stimulus term valiant might cue off the already 
learned scared response, which, in turn, might stimulate frightened. 

We can also construct an A-B, A'-B paradigm in which the responses in the 
successive lists are identical and the stimuli are similar. This also tends to be a 
positive transfer paradigm. Then there is the A—B, C—B paradigm, in which the 
responses are identical and the stimuli are unrelated. The more patterns we devise 
and test, the more confusing the situation becomes, and the more difficult it 
becomes to keep them all straight in one’s mind (see Table 7.2). 

OSGOOD’S TRANSFER SURFACE. Fortunately, Osgood (1949) has developed a 
mnemonic device that summarizes the transfer effects expected in many para¬ 
digms. The device is neither perfect nor comprehensive. But it is convenient, and it 
does provide us with a simple method for keeping the welter of transfer para¬ 
digms and similarity relationships somewhat organized in our mind. A modified 
version of the original Osgood surface is shown in Figure 7.10. Consider the plane 
depicted in the left side of the figure. Degrees of similarity between first-list and 


CHAPTER 7/GENERALIZATION AND DISCRIMINATION 213 








Table 7.2 Sample Items Representing Four Basic Two-List Transfer Paradigms 


FIRST LIST 
(A-B) 

SECOND, OR TRANSFER, LIST 

C-D 

A-D 

A-B' 

C-B 

LOH-tranquil 

TUN-afraid 

BAV-insane 

JAX-complete 

GOZ-royal 

WIF-barren 

DEX-spoken 

PEC-double 

LOH-royal 

TUN-barren 

BAV-spoken 

JAX-double 

LOH-calm 

TUN-fearful 

BAV-crazy 

JAX-total 

GOZ-tranquil 

WIF-afraid 

DEX-insane 

PEC-complete 


second-list stimuli, ranging from identity to neutrality (unrelated), are arranged 
along the width of the plane. Degrees of response similarity, ranging from identity 
to neutrality, are spread along the length of the plane. (Actually, the Osgood 
surface extended the response dimension to include opposed and antagonistic 
responses, but neither extension has proved to be of much value.) The plane 
contains points that represent all possible combinations of stimulus and response 
similarity. The location of a few of the more important paradigms are indicated 
on the plane. For example, the A—B, C—B paradigm is located in the far left 
corner, where response similarity is at a maximum (identity), and stimulus simi¬ 
larity is minimal. The A—B, A—B paradigm (identical lists) is located in the near 
left corner, and so on. 


A-B 




Left side: As stimulus similarity increases with identical responses, transfer increases from 
zero to maximum positive. 

Right side: As stimulus similarity increases with unrelated responses, transfer shifts 
from zero to maximum negative. 

Back side: As response similarity increases with unrelated stimuli, transfer remains at zero. 

Front side: As response similarity increases with identical stimuli, transfer shifts from 
maximum negative to maximum positive. 


Figure 7.10 A modified version of the Osgood transfer and retroaction surface. (After 
Osgood, C. E. The similarity paradox in human learning: A resolution. Psychological 
Review , 1949, 56, 132-143. Fig. 5, p. 140. Copyright 1949 by the American 
Psychological Association.) 
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Now, having located all possible combinations of stimulus and response simi¬ 
larity on a plane, we may ask the next question: How much and what kind of 
transfer (positive or negative) do we expect in each of these paradigms? To 
answer this question we turn to the right side of Figure 7.10. Here we find a flexed 
surface that is superimposed on and that cuts through our plane at the dotted line. 
This surface will indicate the kind of transfer that we expect in any given 
paradigm. Notice that the surface is above the plane at its left end and below the 
plane at its right end. The vertical dimension represents the direction and amount 
of expected transfer. If the surface is above the plane, we expect positive transfer 
in those paradigms lying under the surface. If the surface is below the plane, we 
expect negative transfer. 

Suppose that we wish to predict transfer in the A-B, A-D paradigm. First, we 
locate the A—B, A—D paradigm on the plane. Second, we draw a line, perpendicu¬ 
lar to the plane, from the plane to the surface. The direction of the line represents 
the direction of the expected transfer. A line drawn upward represents positive 
transfer; a line drawn downward represents negative transfer. The length of the 
line represents amount of transfer. The longer the line, the stronger is the transfer 
effect. Thus, we expect a large amount of negative transfer in the A-B, A-D 
paradigm. We must draw a long line downward to reach the surface from the 
plane at point A-B, A-D. Try predicting transfer in the A-B, A'-B paradigm. 
First locate the paradigm on the plane. Then draw a line from the plane to the 
surface at the point. The surface predicts moderate positive transfer in this 
paradigm. 

This is an interesting and important model. Once one grasps it, it serves as a 
convenient summary device. What it says in words is this: When responses are 
identical and stimulus similarity increases, then transfer will increase from zero to 
maximum positive (this is the function represented by the left edge of the surface). 
When responses are unrelated and stimulus similarity increases, then transfer will 
increase from zero to maximum negative (the right edge of the surface). When 
stimuli are unrelated and response similarity increases, transfer remains at zero 
(the back edge). When stimuli are identical and response similarity increases, 
transfer shifts from maximum negative to maximum positive (the front edge). 

The overall surface has received some experimental support. Dallett (1962) 
chose 12 points on the surface representing three degrees of response similarity 
and four degrees of stimulus similarity. He constructed lists corresponding to 
these 12 paradigms and tested for transfer. His results support the overall shape 
and nature of the hypothetical surface. 

Serious objections to the surface have been raised, however. The most impor¬ 
tant of them has to do w’ith the impact of meaningfulness on transfer. It seems 
that the surface is most suitable as a summary statement of the transfer effects 
expected when highly meaningful materials are employed. It does not accurately 
predict the amounts of transfer that will be obtained under conditions of low 
meaningfulness (Dean & Kausler, 1964; Jung, 1963). Thus, our use of the Osgood 
surface must be tempered by an awareness of its limitations. 

Generalization and Transfer of Training 
We have repeatedly noted that transfer and generalization are related in that both 
are concerned with the effects of prior learning on subsequent performance. In 
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spite of the fact that the two fields have developed somewhat independently (for 
example, different variables, designs, methods, vocabularies, and investigators), 
psychologists have taken note of the basic correspondence between the two 
phenomena. 

Consider stimulus generalization. We train a response to a given stimulus. We 
then present similar stimuli and observe whether the response occurs. The A-B, 
A'-B transfer paradigm, in which stimulus similarity is varied, seems to be related 
to stimulus generalization. We connect B to A and then present a similar stimulus 
(A'). The more similar A' is to A, the more likely,A' is to elicit B. We can label this 
effect either stimulus generalization or positive transfer. Variations in interlist 
stimulus similarity correspond to variations in the similarity between the training 
and the test stimuli in a typical stimulus generalization study. An awareness of 
this basic continuity formed the basis for Gibson’s (1940) classic analysis of 
verbal transfer effects in terms of the principles of stimulus generalization. 

But a consideration of stimulus generalization is not enough to account for all 
our observed transfer-of-training phenomena. For example, what about the A—B, 
A-B' paradigm? This does not seem to involve stimulus generalization, but it 
does involve another type of generalization, which we can term response gener¬ 
alization. It was left to Osgood (1946, 1949, 1953) to mesh, or integrate, the 
concepts of stimulus generalization and response generalization in this analysis of 
transfer effects. Very simply, response generalization refers to the fact that, if a 
given response is connected to a given stimulus, that stimulus will also tend to 
elicit similar responses. The less similar the response, the less likely it will be to 
occur. For example, if a girl’s name is Melodie, then there will be a tendency for 
people to call her Melanie but not much of a tendency to call her Joan. If we 
connect the word bouse to a given stimulus in the laboratory, then that stimulus 
will, in one way or another, also tend to elicit home. To return to the thread of the 
analysis, the A—B, A—B' transfer paradigm seems to involve varying degrees of 
response similarity. If we connect bouse to a given stimulus and then require 
subjects to associate borne with the same stimulus, we will observe a positive 
transfer effect brought about by response generalization. 

Osgood’s surface represents an integrated estimation of the effects of both 
stimulus and response generalization in transfer-of-training paradigms. It is large¬ 
ly through his efforts that the concepts of stimulus and response generalization 
were extended into the field of verbal learning, where they contributed to the 
theoretical analysis of transfer-of-training effects. 

You should not be misled by the fact that generalization and transfer of 
training are often treated as separate topics within the field of psychology. Their 
intimate relationship is discussed in more detail by Postman (1971). 


SUMMARY 

1. Generalization refers to the fact that, if a given response is connected to a 
particular stimulus, similar stimuli will tend to elicit that same response. The 
greater the similarity, the greater is the generalization. 

2. Some studies show that the generalization gradient will become more peaked 
as the number of reinforced responses to the training stimulus is increased; 
others show no change or actual flattening of the gradient. 
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3. Generalization tends to be greater following partial reinforcement than after 
continuous reinforcement. However, this effect becomes less clear when one 
tests for generalization during the intervals between reinforcements. 

4. Increased drive level may lead to less generalization in a simple task and more 
generalization in a complex task. 

5. Prior experience with the stimulus dimension tends to result in steeper 
generalization gradients. 

6. Primary generalization involves some innate stimulus dimension. Secondary 
generalization involves an acquired, or learned, stimulus dimension. 

7. In semantic conditioning a response is first connected to a given word. It is 
then found that similar words tend to elicit that same response. 

8. In the acquired equivalence of cues paradigm two distinct stimuli are paired. 
A response is connected to one of the two. It is then found that the remaining 
stimulus will tend to elicit that same response. 

9. Tendencies not to respond can generalize as well as tendencies to respond. 

10. Generalization is important in applied fields such as behavior modification. 

11. Discrimination refers to an increase in the tendency to restrict a response to 
one stimulus value along a stimulus dimension. It is brought about by 
differential reinforcement. Stimulus generalization and discrimination may 
well be opposite sides of the same coin. 

12. Many think of generalization as imprecise stimulus control, whereas dis¬ 
crimination refers to precise stimulus control. 

13. In discrimination learning, responses to both the positive and negative stimu¬ 
li initially increase. Then the response to the negative stimulus drops out, and 
the response to the positive stimulus increases. 

14. In the simultaneous presentation technique both the positive and the negative 
stimuli are presented together. The successive presentation technique in¬ 
volves the sequential presentation of the positive and negative stimuli. 

15. According to the Hull—Spence theory of discrimination learning, the net 
probability of responding to any point along a dimension is reflected by the 
algebraic sum of the values of the generalized positive and negative tenden¬ 
cies at that point. 

16. The relational interpretation argues that in solving a discrimination problem 
subjects are learning a relation between the stimuli and that they respond on 
the basis of that perceived relationship. 

17. Transposition refers to the fact that, if subjects learn to discriminate between, 
say, a large positive stimulus and a smaller negative stimulus and are then 
presented the large positive stimulus and a still larger new test stimulus, they 
will tend to respond to that new stimulus rather than to the old positive one. 

18. The relational theoreticians argued that the transposition effect proved that 
subjects learn a relation between the positive and negative stimuli. But 
Spence was able to predict the same transposition effect using the Hull— 
Spence model. 

19. Noncontinuity theory argues that discrimination is the result of hypothesis 
testing. 

20. If a pigeon is first reinforced for pecking either of two keys and then one key 
is made the positive stimulus (reinforcement is continued) while the other key 
becomes the negative stimulus (reinforcement is discontinued), the response 
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to the positive stimulus increases even though its treatment is constant across 
the two phases of the experiment. This is called behavioral contrast. It may 
result from a motivating frustration factor. 

21. In errorless discrimination learning a firm response to the positive stimulus 
alone is established. The negative stimulus is then introduced into the situa¬ 
tion so gradually that the animal never responds to it. 

22. In insoluble discrimination situations the stimuli are made so similar or the 
reinforcement is presented so randomly that the subject cannot solve the 
problem. Behavior disturbances, which have Been likened to human neurotic 
and psychotic behaviors, have been observed in animals in these situations. 

23. Although in this chapter we concentrate on human verbal discrimination 
processes, humans’ ability to discriminate and process visual material is also 
refined, and it will be discussed in a later chapter. 

24. The stage analysis of verbal learning suggests that a simple S-R association 
involves at least three distinct subprocesses, or stages—stimulus learning, 
response learning, and associative learning. Each stage appears to involve 
discrimination processes. 

25. In verbal-discrimination learning (VD) subjects are presented long series of 
pairs of items over and over. The subjects’ task is to identify which member 
of each pair is the “correct” item (as arbitrarily chosen by the experimenter). 
This paradigm represents a direct extension of the discrimination paradigm 
into the field of verbal training. 

26. The frequency theory of VD learning argues that the discrimination between 
“right” and “wrong” items is made on the basis of the relative frequency of 
the members of each pair. The more frequency units an item has acquired, 
the more likely it is to be labeled as the correct item. Perceiving responses, 
pronouncing responses, and rehearsing responses all add frequency units to 
an item. 

27. Freqency theory accounts for many of the data in VD learning. 

28. Response differentiation refers to the process wherein subjects refine their 
response down to the point at which it yields reinforcement with a minimum 
of effort. 

29. Verbal conditioning represents an example of response differentiation. Sub¬ 
jects are asked to say words as they come to mind. The experimenter may 
increase the occurrence of certain types of word (for example, plural nouns) 
by reinforcing such words with a murmured “mm-hmm.” 

30. A controversy exists over whether such verbal conditioning can occur when 
the subjects are unaware of the reinforcement and its relationship to the 
critical response class. 

31. The study of transfer of training focuses on the positive and negative effects 
of prior training on subsequent training. 

32. Two general classes of transfer event are referred to as specific and non¬ 
specific transfer. Specific transfer refers to the impact of specific first-list 
associations on attempts to establish specific second-list associations. Non¬ 
specific transfer refers to transfer resulting from such factors as learning to 
learn and warm-up. These factors are independent of the specific contents of 
the lists. Warm-up effects are presumed to be transitory, whereas learning-to- 
learn phenomena are more permanent. 
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33. The similarity between successive lists has proved to be one of the more 
powerful transfer variables. The relationship between transfer and interlist 
similarity is not simple. 

34. Osgood has developed a transfer surface that summarizes the direction and 
amount of transfer expected in transfer situations. 

35. The surface predicts that, when responses are identical and stimulus similar¬ 
ity increases, then transfer will increase from zero to maximum positive. 
When responses are unrelated and stimulus similarity increases, transfer will 
shift from zero to maximum negative. When stimuli are unrelated and 
response similarity increases, then transfer remains at zero. When stimuli are 
identical and response similarity increases, then transfer will shift from 
maximum negative to maximum positive. 

36. With a few exceptions the surface summarizes transfer effects obtained with 
highly meaningful, or familiar, materials. But when materials of a low level of 
meaningfulness are employed, the surface is not accurate. 

37. The relationship between transfer of training and generalization is a close 
one. Transfer-of-training paradigms involve both stimulus and response 
generalization. For example, the A-B, A'—B paradigm involves interlist 
stimulus generalization, whereas the A—B, A—B' paradigm involves response 
generalization. 
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PARI 

Four 

Retention 


I n this part we focus on retention 
processes. Once again it should be 
remembered that it is impossible 
to completely separate acquisition, 
transfer , and retention. A concern for 
one of these three is a matter of empha¬ 
sis rather than of clear and distinct 
differences among them. Part Four 
concentrates on what happens to 
memory traces after learning has been 
accomplished. Important points of 
contact between the study of human 
and animal memory are stressed to¬ 
ward the end of the part. 

The field of memory and retention 
is enormous, and in a text such as this 
we must be selective. Sometimes the 
field is organized around different 
memory tasks. Some authors consid¬ 
er, in a sequential fashion, theory and 
data associated with such tasks as 
paired-associate learning, serial learn¬ 
ing, free recall, and recognition learn¬ 
ing. Each of these tasks is then subdi¬ 
vided; for example, w r e may consider 
free recall, in which the items are pre¬ 
sented for either a single trial or for 
multiple trials. The organization as¬ 
sumed in this text is a bit different. 
We shall focus on alternative theoret¬ 
ical positions and issues rather than 


on tasks per se; data will be drawn 
from the various task situations when 
it is relevant to a particular theoreti¬ 
cal issue. In Chapters 8 and 9, for 
example, we cover six major inter¬ 
pretations of memory. 

Chapter 8 begins with a discussion 
of interference. The basic assumption 
behind interference theory is that re¬ 
tention losses can be traced to the 
interfering effects of previous or sub¬ 
sequent learning. For example, if we 
learn a phone number such as 872- 
6314, we will have little trouble re¬ 
membering it over time. But if after 
processing this number we go on to 
learn 843-1762, 782-6413, 287-1463, 
and 378-2614, we may then have 
trouble recalling the original number. 
In other words, the subsequent learn¬ 
ing somehow interferes with our abil¬ 
ity to recall the original number. In 
the latter part of Chapter 8 we move 
on to a discussion of decay theory and 
close with a consideration of the con¬ 
solidation hypothesis. 

Chapter 9 begins with a look at 
separate-store models. This approach 
draws heavily on information-mea¬ 
surement concepts and the language 
of digital computers. The individual is 
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seen as an information-processing 
system. Memory is thought of in 
terms of the encoding, storage, and 
retrieval of information. In contrast 
to the single theory associated with 
the interference approach, many infor¬ 
mation-processing models of memory 
are available. We examine several of 
them. We then consider the concept of 
levels of processing and a number of re¬ 
cent semantic-network models. 

Chapter 10 begins by noting that 
psychologists have distinguished 
among many types of memory. We 
ask whether these distinctions are 
valid and whether distinct models will 
be necessary for the proposed types of 
memory. Several important distinc¬ 
tions are discussed (for example, 
short-term versus long-term memory, 
recall versus recognition, verbal ver¬ 
sus visual processing, the processing 
of alternative forms of information, 
and animal versus human memory). 
By drawing on available experimental 


data we attempt to assess the validity 
of these various distinctions. 

Chapter 11 considers the role of 
organization and elaboration in im¬ 
proving memory. Organization in 
memory refers not so much to a par¬ 
ticular, model as to an orientation to¬ 
ward the study of memory. The basic 
assumption behind organization re¬ 
search is that we remember materials 
by relating them to one another, by 
building “mental structures,” and by 
ordering materials in many different 
ways. In a sense the organizational 
approach represents a return to an 
awareness of the complexity of cogni¬ 
tive events. New methods of assessing 
organization have stimulated this re¬ 
turn to some very interesting prob¬ 
lems. We also discuss several lines of 
research (for example, encoding, mne¬ 
monics, and imagery) and attempt 
to relate them to issues of general 
concern in the field of memory. 
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INTRODUCTION 

Efforts to understand memory have been sprightly and active for many years. 
Although no single model or theory has proven to be completely adequate, great 
strides have been made. In this and the next chapter we shall examine six major 
approaches to the understanding of memory. Interference, decay, and consolida¬ 
tion theories are discussed in this chapter, and separate-store models, levels of 
processing, and semantic-network models are considered in Chapter 9. We begin 
now with some of the aspects of interference. 

\ 

PROACTIVE INHIBITION 

Suppose that you are invited to learn an ordinary 10-item paired-associate list. 
After mastering the list completely you are dismissed with instructions to return 
at the same time the next day. When you return, the experimenter presents the 
stimuli one at a time with instructions to recall the correct responses. You find 
that you can remember only 8 of the 10 responses. 

You have forgotten 20% of the material in 24 hours. Where did it go? What 
happened to that material that you knew so well so recently? These questions 
form the basis of the investigation of memory. We want to know what happened 
to the other two items and why they were forgotten. 

There is a powerful interpretation of forgetting that is best introduced by 
extending the experiment described above. Suppose that on Day 2, after you have 
recalled the list you learned on Day 1, the experimenter gives you another 
paired-associate list to learn. After you master this second list, she dismisses you 
with instructions to return the next day. On Day 3 you attempt to retrieve the 
second list. You then learn a third list, and so on. Let us say that you learn 20 
somewhat similar paired-associate lists on 20 successive days. On each day you 
are required to recall only the list you learned the previous day. In other words, 
the retention interval for any given list is 24 hours. The question is this: How 
much of the 20th list will you be able to recall 24 hours after it is learned? 

Some students suggest that recall of that final list will be almost perfect. You 
have, after all, had all sorts of practice in learning and recalling paired-associate 
lists. And we have already seen in the last chapter that learning to learn can be a 
powerful aid to performance on successive, similar tasks (see Figure 7.9). But that 
is not what happens at all. Recall of that final list is extremely poor. In fact, you 
probably will not be able to recall more than two or three of the items you learned 
just 24 hours ago. You have forgotten, or are unable to produce, 80% of the correct 
responses. That is a large effect, and we shall want to find out what causes it. 
Obviously, it must have something to do with all those prior lists you learned. The 
language usually used here suggests that the first 19 lists somehow interfere with, 
or compete with, attempts to recall the 20th list. The materials from all those 
other lists keep popping into your mind as you try to recall the 20th list. The old 
materials block the correct responses, or confuse you when you try to recall them. 

This effect is called proactive inhibition. It refers to a disruption in our ability 
to retrieve a given set of materials owing to the interfering effects of previously 
learned materials. The more previous lists we have learned, the greater the loss of 
retention. Figure 8.1 clearly indicates that recall of a given list over 24 hours 
declines as a function of the number of prior lists learned in the same experiment 
(Underwood, 1957). 
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Figure 8.1 Proactive inhibition as a function of the number of previously learned lists. 
Each point represents the results of a different experimental study. (Adapted from 
Underwood, B. J. Interference and forgetting. Psychological Review, 1957, 64, 49-60. 
Fig. 3, p. 53. Copyright 1957 by the American Psychological Association. Reprinted by 
permission.) 


Table 8.1 contains the basic experimental design used to demonstrate proactive 
inhibition (PI). The critical comparison is between the experimental and control 
recall of Task 2. If the experimental subjects display poorer Task 2 recall, we say 
that PI has occurred. The recall of a given list (Task 2) has been disrupted by prior 
learning (Task 1). 

Proactive inhibition is not a theory or an explanation. It is a fact, and an 
important one. It refers to the enormous amount of forgetting that can be 
attributed to the interfering effects of prior learning. The more we learn, or store, 
the more susceptible we are to this type of interference. The more organic 
chemistry we commit to memory, the greater is the probability of confusion. The 
more names and faces a professor attempts to match in his lecture section, the 
greater are the chances that he will have difficulty remembering additional 
name—face pairings. This is not to say that adding to our storage invariably leads 
to poorer retrieval of newly acquired materials (see Postman &£ Gray, 1977). To 
the contrary, it is possible to overcome potential PI effects by ordering and 
choosing our materials so that they do not fall into an interfering pattern. In 
addition, we can build on, or structure, our knowledge so that we actually find 
proactive facilitation, or improved retention, rather than PI. And yet PI has to be 


Table 8.1 Proactive Inhibition Experimental Design 


GROUP 

TASK 1 

TASK 2 

TASK 3* 

Experimental 

Learns A 

Learns B 

Retention of B 

Control 

— 

Learns B 

Retention of B 

*Task 3 measures PI. 
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considered a major factor in many observed retention losses. Prior learning 
constitutes a powerful source of interference (see Underwood, 1983). 

Is proactive inhibition the same thing as negative transfer? No, but students 
often confuse the two. As you will recall from the last chapter, negative transfer 
refers to what happens when the subject is learning the second list. PI refers to 
what happens during the recall of that second list at a later time, after some 
retention interval has elapsed. Negative transfer appears during Task 2 in Table 
8.1; PI appears during Task 3 in the same table. Negative transfer refers to the 
inhibiting effects that the learning of A has on the subsequent new learning of B. 
Proactive inhibition refers to the inhibiting effect that the learning of A has on the 
recall of B after some time has elapsed since the learning of B. 

Negative transfer can occur during a PI experiment (during the performance of 
Task 2 in Table 8.1), but the effects are different. Proactive inhibition involves 
three stages and recall of previously learned material, whereas negative transfer 
involves only two stages and involves new learning. 

RETROACTIVE INHIBITION 

There is another type of interference, called retroactive inhibition (RI). Retroac¬ 
tive inhibition refers to interference resulting from subsequent learning. The basic 
RI design is depicted in Table 8.2. The experimental subjects learn two successive 
tasks. Then, at the end of some retention interval (typically ranging from none to 
several weeks) their ability to retrieve the Task 1 materials is tested. If their 
retention is poorer than that of the controls, we say that RI has been demon¬ 
strated. If a 10-year-old learns numerous baseball statistics connected with the 
National League and then proceeds to do the same with American League 
statistics, then his ability to recall the National League statistics may well be less 
than if he had not acquired the American League materials. In situations like this 
retroactive-inhibition effects can be large. That is, a great percentage of our 
responses can be made unavailable through the manipulation of subsequent 
learning. 

Note that Table 8.2 suggests that the controls are resting, or doing nothing, 
while the experimental subjects are learning the second task. Control subjects are 
not actually allowed to rest during this interval. If they were, they would spend 
the time rehearsing Task 1, which would give them an advantage over the 
experimental subjects. During this interval, control subjects typically engage in 
some unrelated filler task, which is designed to prevent them from rehearsing 
Task 1 . These filler tasks include such things as working perceptual-motor puz¬ 
zles, counting backward, solving simple mathematical problems, and crossing out 
designated digits on sheets of endless digits. 

Just as proactive facilitation can sometimes be observed in proactive experi¬ 
ments, retroactive facilitation sometimes occurs in retroactive experiments. This 
refers to the situation in which the materials are structured so that the learning of 
that second task helps the subjects remember the first task. Although retroactive- 
facilitation effects are easily demonstrated, our focus will be on interference 
effects because we are, after all, primarily interested in memory losses in this 
chapter. 

In summary, it seems that our ability to remember a given set of materials is 
strongly affected by both prior and subsequent learning. Ironically, the very act of 
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Table 8.2 Retroactive Inhibition Experimental Design 


GROUP 

TASK 1 

TASK 2 

TASK 3 * 

Experimental 

Learns A 

Learns B 

Retention of A 

Control 

Learns A 


Retention of A 


•Task 3 measures Rl. 


learning itself constitutes a source of forgetting. Learning can, under various 
circumstances, reduce our ability to remember what we have already learned, and 
it has the potential to disrupt our ability to remember what we will learn in the 
future. The exact nature of these disruptive processes will be the subject of an 
upcoming section concerning interference theory. 

INTERTASK SIMILARITY 

In the last chapter we discussed variations in stimulus and response similarity in 
two-list transfer situations. Osgood’s (1949) transfer surface summarized the 
various transfer paradigms. Exactly the same similarity variations can be investi¬ 
gated in three-step RI and PI situations. For example, we can devise an RI 
paradigm corresponding to the A—B, A—D pattern. The subjects learn two succes¬ 
sive lists (possessing identical stimuli and unrelated responses). They are then 
tested for retention of the first, or A—B, list. Similarly, we might want to investi¬ 
gate an A—B, A—B' proactive-inhibition situation in which the subjects are tested 
for retention of the second list (A—B') after they have learned two lists that 
possess identical stimuli and similar responses. We can vary the similarity of the 
stimuli or the responses in any manner we wish. We can then test for either 
first-list retention (RI) or second-list retention (PI). 

The close relationship between transfer paradigms and RI and PI designs 
should now be apparent. In transfer paradigms we focus on, or test, second-list 
learning, whereas in RI and PI paradigms we focus on, or test, retention of the 
materials over time. The PI and RI paradigms add a third step (the retention test) 
to the two-step transfer paradigms. Thus, transfer is involved in every PI and RI 
design. It is just that in PI and RI designs we do not focus on second-list learning. 
Rather, we are interested in what happens to the strengths of the materials after 
learning has been completed. For example, in an A—B, A—D transfer paradigm we 
look at the subjects’ performance during learning of the second list. In an A-B, 
A—D proactive-inhibition paradigm we test for A—D strength not while the 
subject is first learning A—D but after learning has been completed and some 
retention interval has elapsed. 

We can devise PI and RI paradigms that correspond to every point on the 
Osgood surface. For example, we can develop RI and PI paradigms that corre¬ 
spond to the A—B, C—D; A—B, C—B; and A—B, A'—B paradigms. Osgood (1949) 
developed his surface with the idea that it would predict retroactive-inhibition 
effects as well as transfer effects. It is, in fact, called the transfer and retroaction 
surface. Thus, the A—B, A—D paradigm is expected to yield both high negative 
transfer and strong retroactive inhibition (which it does). Similarly, the A—B, 
A'—D paradigm is expected to yield moderate negative transfer and moderate 
retroactive inhibition, and so on. 
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The predictions made from the surface with respect to RI have received some 
limited support (Bugelski &c Cadwallader, 1956; Gibson, 1941; Hamilton, 1943; 
Kanungo, 1967). But, as is true of the transfer predictions of the surface, the RI 
predictions have proved to be less than perfect. The student should thus consider 
the surface not as a perfect predictor but as a convenient mnemonic device. When 
one becomes lost in the maze of RI and PI paradigms, the Osgood surface 
provides a handy mechanism for sorting them all out and for perceiving their 
relationships to one another. 

\ 

THE GENERALITY OF INTERFERENCE 

Before we move to a consideration of interference theory, one final point should 
be made. Interference effects can be quite general and operate in the world outside 
the laboratory. Even though most of the research seems to be cast in paired- 
associate, serial, and free-recall situations, we should keep in mind that interfer¬ 
ence effects are pervasive and are not limited to these somewhat artificial tasks. 

For example, interference effects can be observed in what psychologists call 
connected discourse (see Slamecka, 1960, 1961). Connected discourses includes 
such things as meaningful sentences and paragraphs. Interference effects can also 
be obtained using two languages (Lopez, Hicks, & Young, 1974). That is, if one 
task is in one language and the second task is in a different language, interference 
effects can be observed. RI and PI can show up in classical conditioning (Wickens, 
Tuber, & Wickens, 1983), in priming experiments (Neely, Schmidt, & Roediger, 
1983), in eyewitness situations (Bekerian & Bowers, 1983), and, as we shall see, 
in many animal activities (Roitblat & Scopatz, 1983). Interference effects can be 
obtained with visual stimuli as well as the verbal stimuli that we have been 
emphasizing (Mendell 1977) and in free-recall situations (Roediger, Stellon, & 
Tulving, 1977). The point is that, even though the interference interpretation 
“grew up” in a “paired-associate neighborhood,” its principles and rules seem to 
guide behavior in many different situations. 

THE TWO-FACTOR THEORY OF FORGETTING 
Thus far we have been speaking of nothing more than the facts. Retroactive and 
proactive inhibition do occur, and they are powerful. They are strongly affected 
by such factors as intertask similarity We now turn to the problem of trying to 
explain these facts. Why do PI and RI occur at all? What factors or mechanisms 
contribute to their particular forms? How much do we know, and how much do 
we not know, about the processes that underlie and cause these undeniably 
powerful interference effects? We now step into the world of theory. 

An Overview 

For many years the tivo-factor theory of forgetting (Melton & Irwin, 1940) stood 
as the uncontested framework within which psychologists attempted to under¬ 
stand forgetting. Although the two-factor formulation has come into sharp con¬ 
flict with more recent conceptions of memory, it still stands as one of the major 
explanatory developments within the field. This section outlines the basic ele¬ 
ments of the system and some of the recent challenges to it. 

The original formulation of the theory proposes that a given retention loss, or 
interference effect, can be determined by at least two important factors, competi¬ 
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don and unlearning. Let us first consider competition. Suppose that we have just 
learned an A—B, A—D sequence. We then attempt to recall the B responses when 
the A stimuli are presented. We find that we cannot recall all the correct B 
responses. Some portion of this retention loss may result from response comped- 
don at the time of recall. When A is presented during the recall phase and we are 
asked for B, our ability to produce B will sometimes be blocked by D. We 
mistakenly recall D instead of B. Incompatible responses have been connected to 
the same stimulus. The recall of one temporarily blocks recall of the other. The 
incompatible responses compete with each other, cause confusion, and lead to 
errors in recall. The concept of response competition will be elaborated on 
shortly. For now, realize that, if we attempt to recall the first-list responses, 
unwanted recall of the second-list responses may disrupt first-list recall (this 
would be an RI effect). Similarly, if we attempt to recall the second-list responses, 
unwanted recall of the first-list items may interfere with recall (this is a PI effect). 

The second of the two interference factors is commonly known as unlearning , 
which is, perhaps, not the best choice of labels, but it will have to do. The idea 
behind the concept of unlearning is that the acquisition of new materials (for 
example, the A—D list) can bring about losses of the old materials (the A—B list). 
As subjects learn the A—D list, there is a concomitant loss in the strength of the 
first-list associations. During the learning of the A—D list, the A—B materials are 
somehow unlearned, weakened, or made less available. First-list associations are 
lost during second-list learning. Exactly how this unlearning occurs will be the 
topic of an upcoming section. 

For now, several points should be kept in mind. First, either or both of these 
factors (unlearning and competition) can contribute to an observed interference 
effect. Second, competition is presumed to be operating at the time of recall, 
whereas unlearning operates during second-list learning. The two factors have 
their impact at different points within the experimental sequence. The third and 
final point is that response competition is presumed to contribute to both PI and 
RI, whereas unlearning contributes to RI alone. Let us first take up the notion 
that competition contributes to both RI and PL Unwanted recall of the second list 
can compete with recall of the first list (RI). Similarly, unwanted recall of the first 
list can compete with attempts to recall the second list (PI). Now let us turn to the 
notion that unlearning contributes to RI but not PL As we have noted, unlearning 
refers to the fact that first-list associations are somehow made unavailable during 
second-list learning. Hence, first-list retention is clearly affected by unlearning 
(RI). But second-list, or A-D, materials do not undergo unlearning in our 
experiment. Hence, losses in second-list retention, or PI, are often presumed to be 
totally determined by competition at the time of recall, whereas RI is presumed to 
be determined by both competition and unlearning. 

The Separation of Competition and Unlearning 
In view of the fact that both unlearning and competition can affect retention, how 
might we go about separating the effects of these two factors in an experimental 
fashion? For years no one knew quite how to do it. For example, RI experiments 
would be run in which the subjects would be given the usual 2 seconds to recall 
each of the B responses following A—B and A—D learning. A—B losses would be 
observed, but no one knew whether they resulted from competition at the time of 


CHAPTER 8/MEMORY MODELS I: INTERFERENCE, DECAY, AND CONSOLIDATION 229 





recall or from unlearning or both. Attempts were finally made to eliminate the 
effects of competition from these experiments. When this was done, whatever 
retention loss was left could be safely attributed to the unlearning factor. Com¬ 
petition effects were eliminated in a very simple manner. The subjects were asked 
to recall both the B and D responses and were given unlimited time to do so. The 
idea was that, if the subjects were not pressed for time and were free to spend as 
much time as they wished in searching for, and sorting out, the B and D responses, 
then the competition effect would be minimized. The obvious assumption here is 
that competition effects are dependent on time pressure. If we remove the require¬ 
ment of recalling the critical item within a few seconds, then the confusion, 
blocking, and momentary competition between B and D responses is eliminated. 
This line of reasoning led to the development of what is commonly called the 
MMFR (modified, modified free-recall) test (Barnes & Underwood, 1959). In this 
type of recall test the subjects are not pressed for time at all and are asked to recall 
the B and D responses in any order in which they come to mind. When this 
unpaced recall task is used, any remaining retention losses can be attributed to the 
unlearning factor alone and are presumed to be free from the effect of the 
momentary competition factor. 

As we have seen, classical two-factor theory states that PI is determined 
completely by response competition at the time of recall. It has also been com¬ 
monly assumed that the MMFR task effectively eliminates the effects of competi¬ 
tion. Hence, it follows that no PI should appear when the MMFR task is 
employed. After learning an A—B, A—D sequence, subjects should have no trouble 
recalling the D responses if they are given plenty of time to do so and are not 
subjected to the confusion produced by paced or limited recall time. This seems to 
be a reasonable prediction. Unfortunately, PI effects have been consistently 
observed with the MMFR task (Ceraso & Henderson, 1965; Postman, Stark, & 
Fraser, 1968). It might be that the MMFR task does not effectively eliminate 
competition effects after all. Or PI might result from something beyond the 
competition factor. Whatever the final outcome, this contradiction, or unpredicted 
result, typifies the problems facing the development and refinement of the two- 
factor theory. The interested reader is referred to Anderson (1983), Bowles and 
Glanzer (1983), Postman, Stark, and Burns (1974), Postman and Keppel (1977), 
and Underwood (1983) for recent conceptions of the nature of PI. 

Unlearning as Extinction 

Is human forgetting anything like animal “forgetting”? Does human “unlearn¬ 
ing” resemble an extinctionlike process? We have already discussed the phe¬ 
nomenon of experimental extinction, which has its origins in the animal litera¬ 
ture. Animals “forget,” or lose responses, through a process of extinction. The 
rat, after learning to press a lever for food, will stop the pressing response if food, 
or reinforcement, no longer follows the occurrence of that response. If the animal 
does not undergo this active unreinforced responding, the response will be main¬ 
tained for long periods. Thus, for the rat to stop its response, it must undergo an 
active process in which the previously reinforced response occurs repeatedly 
without reinforcement. 

Do humans lose such things as verbal responses in a similar manner? Do we 
forget because we undergo extinction? On an intuitive level it does not seem 
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likely. When we forget someone’s name, the title of a book, or a section of a text, 
it does not seem as though we have undergone extinction. Where and when did 
we make the response and suffer a state of nonreinforcement? 

Interestingly, although the mechanism is not obvious, some human forgetting 
has been thought of as analogous to experimental extinction (Keppel, 1968; 
Postman, 1971; Postman & Underwood, 1973). Specifically, the unlearning factor 
just outlined has been conceived of as an instance of experimental extinction. The 
analogy is as follows. Consider the A—B, A—D situation. During A—D learning, 
the subjects sometimes mistakenly respond with B rather than D. The subjects 
have just learned the A—B list, and it is “on their mind.” These errors may be 
either overt or covert. Each time the B response occurs as an error, it is not 
reinforced. (The computer or the experimenter does not confirm the B response as 
correct.) The occurrence of the B response during A—D learning constitutes an 
instance of unreinforced responding, or experimental extinction of the A—B 
association. The greater the number of overt or covert B responses occurring 
during A—D learning, the greater is the extinction or unlearning of the A—B 
associations. This is an interesting idea. It represents an important point of 
contact between the animal and human literature and suggests that at least some 
animal extinction and human forgetting may be comparable. 

If the unlearning factor is truly analogous to experimental extinction, then 
human forgetting should display the important properties of the extinction phe¬ 
nomenon. Spontaneous recovery is a case in point. If you will recall, a response 
that has been extinguished will, with rest, recover some of its strength. For 
example, suppose that a rat’s bar-pressing response has been completely extin¬ 
guished and that the animal has been removed from the Skinner box for, say, 24 
hours. We find that, when the animal is returned to the box after the rest interval, 
the response has spontaneously recovered some of its strength. 

If the analogy between unlearning and extinction is to be maintained, then a 
similar sort of recovery phenomenon must occur with verbal materials. If an A—B 
list is extinguished during A—D learning, then, as time passes, the A—B list must 
show some recovery. The unlearning or extinction must be at least partially 
reversible. Although the results have not been completely consistent, such a 
recovery phenomenon has been demonstrated (Forrester, 1970; Martin & Mac- 
kay, 1970; Shulman & Martin, 1970). It is not an easy effect to obtain, but the 
fact that it has been observed lends some support to the extinction interpretation 
of unlearning. 


Competition: Specific Versus Generalized 
In our introductory discussion of response competition we noted that retrieval of 
one response will sometimes be blocked by the unwanted recall of the other 
response that has been associated with the same stimulus. This effect is referred to 
as specific response competition. The recall of one specific response blocks the 
recall of another specific response. But there is another type of competition that 
leads to disruption of the retention process. It is referred to as response-set 
interference and has been elaborated by Postman, Stark, and Fraser (1968). (The 
ideas behind the response-set interference hypothesis were originally put forward 
by Newton and Wickens in 1956.) The response-set interference analysis goes like 
this: While subjects are learning a first list, they restrict their responses to those 
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that are appropriate to that list. They inhibit the occurrence of incorrect re¬ 
sponses, or responses that do not belong to the set of correct first-list responses. 
Then, when the subjects are shifted over to a second list, they shift over to the set 
of correct second-list responses. They inhibit the occurrence of the entire set of 
first-list responses. They concentrate or focus on the set of correct second-list 
responses. But the subjects cannot shift from one set of responses to another 
instantaneously. As Postman and Underwood (1973) put it, the tendency to give 
responses from a given list is possessed of a certain amount of inertia. There will 
be a strong tendency for the subjects to continue to give responses from the most 
recently learned list. It takes a while for the subjects to shake off this tendency, or 
to change their orientation. Hence, after the subjects have completed second-list 
learning, the tendency to give second-list responses persists for a while. 

Now suppose that we ask the subjects for the first-list responses immediately 
following the completion of second-list learning. Presumably, the subjects will be 
plagued by the persisting tendency to give responses from the more recently 
learned list. These second-list responses will compete with, and block, retrieval of 
the first-list responses. Thus, at least part of the subjects’ inability to come up 
with the first-list responses in an RI situation can be attributed to response-set 
interference. 

There have been many attempts to test the response-set interference hypothesis. 
The formulation has led to a good number of testable hypotheses and has received 
considerable support in the literature. The interested reader is referred to Cofer, 
Failie, and Horton (1971), Lehr, Frank, and Mattison (1972), and Postman and 
Stark (1969). 

One point should be kept firmly in mind. The notion of response-set interfer¬ 
ence is not intended to replace the other interference mechanisms (that is, un¬ 
learning and specific response competition). It is intended to supplement them. In 
any given situation any of these interference mechanisms may contribute to 
overall retention losses. It is the task of the proponents of the two-factor position 
to clearly delineate when, and to what extent, each of these potential sources of 
interference determines overall forgetting. 

CHALLENGES TO UNLEARNING 

Independent Retrieval 

The two-factor theory is undergoing revision and refinement. Despite what its 
detractors say, it does have explanatory power, and it has generated an enormous 
amount of research. This is not to say that the theory is without faults. It is not 
entirely comprehensive, nor can it adequately account for all known interference 
effects (for example, PI obtained with the MMFR test). We begin our exploration 
of alternative conceptions of memory by examining some of the recent challenges 
to the classical two-factor interpretation of interference effects. Then, in succes¬ 
sive sections we shall discuss alternative models of memory and their adequacy in 
relation to the two-factor position. 

According to classical two-factor theory, A-B associations are unlearned dur¬ 
ing A—D learning. -Martin (1971) and a number of other investigators have 
suggested that there is evidence that denies this unlearning principle. They reason 
that, if learning A-D entails unlearning A-B, then the recall of B should be less 
likely when D is recalled than when D is not recalled. In other words, the recall 
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of B and D should be dependent on each other. As the strength of A—D is built up, 
the strength of A—B should decrease. Thus, if we test for the recall of both 
responses, the subjects should be less likely to recall B when D is recalled. The 
probabilities of recalling B and D should be inversely related. 

A number of investigators have obtained results that appear to contradict this 
expected inverse relationship between B and D recall (Greeno, James, & DaPoli- 
to, 1971; Martin, 1971; Martin & Greeno, 1972; Wichawut & Martin, 1971). In 
general, they found that B and D recall are independent of each other. That is, the 
recall of B is just as likely given that D is recalled as it is given that D is not 
recalled. This effect has been labeled the independent retrieval phenomenon. 
Martin (1971) suggests that it deflates the concept of unlearning, because the 
unlearning principle requires, or implies, an inverse relationship between B and D 
recall. Martin suggests that we must look elsewhere for an explanation of RI 
effects. 

The validity and relevance of the independent retrieval phenomenon is not 
universally accepted (see Hintzman, 1972). Postman and Underwood (1973) have 
attacked the very logic put forward by Martin (1971). They suggest that Martin is 
wrong in assuming that the unlearning principle demands an inverse relationship 
between B and D recall. They argue that it is not the actual acquiring of the D 
response that is critical in the unlearning of the B response. As we have seen, it is 
the occurrence of B as an unreinforced response during A—D learning that is the 
critical factor in the unlearning of the A—B association. Second-list, or A—D, 
learning merely provides the opportunity for the B response to occur as an error. 
There is nothing in two-factor theory that states that it is the learning of the A—D 
association that actually causes the A—B loss or that the A—B associations must 
be unlearned before A—D associations can be learned. Thus, two-factor theory, 
according to Postman and Underwood (1973), does not demand an inverse 
relationship between D and B recall. 

Encoding Variability 

In any case, whether or not the independent retrieval phenomenon actually 
invalidates the unlearning principle, Martin (1971, 1972) has proposed an 
alternative interpretation of interference effects. His formulation is intended to 
replace, and not merely supplement, the classical two-factor interpretation. It is 
based on his concept of stimulus encoding variability. If you will recall, a given 
stimulus is conceived of as possessing many different attributes, or components. 
During A—B learning the B response is not connected to the entire A stimulus but 
rather to some selected portion of the overall stimulus. Then, during A—D 
learning the D response is not connected to this same set of attributes. It is 
connected to a different, or at least partially different, set of stimulus attributes. 
Thus, D and B are connected to different functional stimuli even though the 
nominal stimulus is identical in A—B and A—D learning. For example, if the A 
stimulus is ZQKO, then the B response might be connected to Z, whereas D 
might be connected to Q. Granting that this is true, how does RI occur, and why 
does the subject have trouble recalling the B responses when presented the total A 
stimulus after A-B and A-D have been learned? According to Martin, it happens 
because of a persisting tendency to sample, select, or attend to those stimulus 
attributes utilized during A—D learning. Following A—D learning, the subjects 
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continue to “pay attention to” those stimulus attributes that they have most 
recently used in hooking up responses. Hence, because of their continuing atten¬ 
tion to these most recently utilized attributes, the D response is cued off rather 
than the B response. 

Let us consider a very simplified example. Suppose that a subject is required to 
connect dog to norop during A—B learning. Imagine that this subject once owned 
a dog named Norton. Hence, she might easily master this particular item by 
associating dog with nor and might ignore the remaining letters in the nominal 
stimulus (op). Now, during A-D learning the' subject might be required to 
connect eye to the stimulus norop. The letters op might remind the subject of 
optical. Thus, she might find it convenient and simple to associate eye with op 
and ignore the remaining letters {nor). Thus, dog is connected to nor, and eye is 
connected to op. Then, during the retention test following A-D learning the 
subject is asked to recall dog given norop as a cue. Martin (1971) argues that the 
subject will continue to attend to, or sample, the stimulus attributes most recently 
utilized. She will focus on, or sample, op. Op leads to eye rather than dog. Hence, 
we have a disruption in the subject's ability to recall the first-list response, or an 
RI effect. 

Martin’s explanation of interference sounds a bit like the Postman, Stark, and 
Fraser (1968) notion of response-set interference. In both analyses subjects cannot 
recall the first list because they are still bound up with the second list. The 
involvement with the second list persists after second-list learning and, in some 
sense, blocks out recall of the first list. The two analyses do differ in terms of their 
emphasis. On the one hand, Martin focuses on a persisting tendency to attend to 
the set of stimulus attributes utilized during second-list learning. On the other 
hand, Postman, Stark, and Fraser (1968) emphasize a persisting tendency to give 
second-list responses. Martin focuses on stimulus persistence, whereas Postman 
focuses on response persistence. It would seem reasonable to suspect that both 
might be right. The tendency to continue giving recent responses and to continue 
attending to recently utilized stimulus components might both contribute to RI. 
Martin’s analysis might best be taken as a supplement to, rather than as a 
replacement for, the more traditional interference mechanisms. Martin’s stimulus¬ 
encoding-variability hypothesis has received considerable attention in the litera¬ 
ture (for example, Ellis, 1973; Goggin & Martin, 1970; Martin, 1972; Mueller, 
Gautt, & Evans, 1974; Williams & Underwood, 1970). 

Accessibility Versus Unavailability 

Martin’s (1971) independent retrieval phenomenon and his encoding variability 
interpretation of the so-called unlearning effect are not the only factors being 
marshaled against the traditional interpretation of interference effects. A number 
of authors (see Nelson & Brooks, 1974; Reynolds, 1977; Tulving & Psotka, 
1971) begin their attack by distinguishing between item availability and item 
accessibility. If an item is stored in our memory, on the one hand, it is available. If 
it is unavailable, then it is permanently gone from our memory or was never there. 
Accessibility, on the other hand, refers to our ability to pull available items out of 
our memory store, or to retrieve them. An item can be available (stored) without 
being accessible (retrievable). For example, you probably have the name of the 
president whose picture is on a $10 bill stored, or available, but, for any number 
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of reasons, it might be temporarily inaccessible (you cannot recall it at the 
moment). If an item is unavailable, it can never be accessible, whereas an avail¬ 
able item can sometimes be accessible and sometimes not. 

The argument here is about whether unlearning is the same as unavailability. 
According to interference theory, if an item is unlearned, it becomes permanently 
unavailable, and if it is unavailable, it should also be totally inaccessible. But if 
an item that has presumably been unlearned can be retrieved, or is accessible, then 
it must not have been truly unavailable. 

The following sort of experiment has been conducted to test these ideas. 
Subjects learn six successive tasks and are then tested for retention of the first-task 
responses. It is usually noted that retention of the first-list material is poor. This is 
an RI effect, and the two-factor theory assumes that the first-list materials could 
not be recalled because they were unlearned during List 2—6 learning and are thus 
unavailable. But now something a little different is done. The subjects are tested 
for first-task recall again. This time they are given some hints, or clues. For 
example, they might be given the names of categories to which the thus far 
unrecalled first-task items belong. If the unlearning hypothesis is correct and the 
items are truly unavailable, this kind of hint should not help the subjects recall 
more first-task items. But it does. Subjects can recall items during the second test 
of first-task recall that they did not remember during the first test. The conclusion 
that some draw from this is that the items were not really unavailable, or 
unlearned, after all; they were merely temporarily inaccessible. The conclusion is 
that RI does not result from unlearning (unavailability) but rather from inaccessi¬ 
bility. The items are presumed to be “in there” somewhere and not totally lost 
through unlearning. 

This kind of experiment convinces some people that the unlearning principle is 
incorrect and that RI is a matter of item inaccessibility rather than permanent 
unavailability. But it will probably take quite a few more clever experiments 
before the venerable concept of unlearning is finally laid to rest (see Brosgole & 
Grosso, 1983). 


DECAY THEORY 

Perhaps the oldest, and by far the simplest, interpretation of memory is called 
decay theory, or sometimes disuse theory. (Decay theory is the second of the six 
interpretations of memory we shall discuss, interference theory having been the 
first.) Quite simply, decay theory suggests that associations somehow grow weak¬ 
er, or fade, with the passage of time. This possibility has intuitive appeal, because 
our memories do seem to fade with disconcerting regularity. The decay notion 
suggests that forgetting is a simple, inexorable function of time, as though our 
memories grow weaker as they grow older. And yet the decay hypothesis has 
proved to be relatively unfruitful. It has not led to many testable hypotheses. The 
reason may be that it is impossible to eliminate all sources of interference from a 
given experimental situation. For example, suppose that we wish to demonstrate 
that the memory traces of a learned list decay with time. Accordingly, we have our 
subjects learn a list and then test for retention after a designated interval. To 
ensure that any loss we observe results from decay, and not RI, we must ensure 
that the subjects learn nothing during the retention interval. If they learn anything 
at all, then the loss could result from RI. It is difficult to “turn subjects off” during 
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a time interval. We could put them to sleep, but such a procedure does not 
eliminate the possibility that some kind of learning activity will occur while they 
are dropping off to sleep, while they are actively dreaming, or while they are 
awakening. (Interestingly, studies involving such sleep procedures seem to sup¬ 
port an interference, rather than a decay, interpretation of forgetting [Ekstrand, 
1967].) And even if we were sure that the subjects learned nothing at all during 
the retention interval, we still could not attribute a retention loss to decay, 
because of possible PI effects. All the subjects’ preexperimental experiences con¬ 
stitute potential sources of proactive interference. (We have to assume that they 
have learned something before they enter the experimental situation.) In short, 
the definitive decay experiment is, as far as we know, impossible. To run a good 
decay experiment we would have to round up subjects before they had learned 
any potentially interfering material at all. Then, after they had learned the critical 
materials, we would have to “turn them off” immediately and completely until 
the time of the retention test. Thus far, no one has been able to establish such a 
pattern, although attempts to demonstrate decay continue (Reitman, 1974; 
Roediger, Knight, &C Kantowitz, 1977). 

This is not to say that decay does not operate or contribute to observed 
retention losses. Interference theoreticians are very aware that some portion of 
our retention losses may result from decay. They are merely pointing out that it is 
currently impossible to isolate the decay effect, or to separate it from interference 
effects. It is like a constant added to all our interference effects. But. we can create, 
predict, and observe enormous variations in retention owing to obvious interfer¬ 
ence factors (for example, by varying the amount, or type, of interfering activity). 
Hence, much more emphasis has been placed on interference effects simply 
because (1) they are extensive and (2) we can relate them to changes in our 
independent variables. 

Another interesting fact about the decay idea is that, even though it is extremely 
controversial and has not been demonstrated to everyone’s satisfaction, it has 
been routinely incorporated as a subprocess into many recent memory models. As 
we shall see in Chapter 9, theoreticians have unhesitatingly assumed that some 
kinds of stored information decay with the passage of time, even though ex¬ 
perimental verification of this fact is lacking. 

CONSOLIDATION 

The third of our six approaches to the understanding of memory is called the 
consolidation hypothesis. 


Retrograde Amnesia 

We begin our discussion of consolidation by first looking at an intriguing ex¬ 
perimental finding called the retrograde amnesia effect. In a retrograde amnesia 
study an animal first learns a particular response (an avoidance response, for 
example). After acquisition of the response the animal is subjected to an amnesic 
agent, such as electroconvulsive shock, puromycin, or potassium chloride. The 
animal is then tested for the original response. The animal’s memory for the 
recently learned behavior seems to be impaired. The term retrograde refers to the 
fact that the amnesic effect appears to increase as the time between the learning 
and the introduction of the amnesic agent is decreased. In other words, recent 
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memories suffer the most. If you shock the animal immediately, the learning is less 
likely to become permanent than if you allow time to pass before administering 
the amnesic agent (Barraco & Stettner, 1976; Kinsbourne & Wood, 1975; 
McGaugh &£ Gold, 1976; Milner, 1970). 

Parenthetically, the retrograde amnesia procedures are reminiscent of the elec¬ 
troconvulsive shock treatments sometimes used with human patients, particularly 
those found to be deeply depressed. The general procedure is one in which the 
patient is first given a muscle relaxant (to prevent damage in the upcoming, 
electrically induced convulsion). Electric current is then passed through the brain 
for a brief moment by way of electrodes attached to the patient’s temples. Two of 
the major results of this type of treatment are that (1) patients come out of their 
depressed state, at least temporarily, and (2) they suffer from some degree of 
amnesia, with more recent memories suffering more than older memories. 

The Consolidation Hypothesis 

Given that retrograde amnesia is a phenomenon that can easily be demonstrated 
with animals (see Spear, 1973) and, for that matter, with humans (Landauer, 
1974; Tulving, 1969), what accounts for the effect? 

Early interpretations (McGaugh, 1966; McGaugh & Dawson, 1971) empha¬ 
sized the concept of consolidation. The notion here is that the amnesic agent 
somehow disrupts consolidation of the memory trace. For a memory trace to 
become permanent, it needs a little time to consolidate. Consolidation interpreta¬ 
tions argue that factors such as electroconvulsive shock disrupt and prevent the 
transfer of information into permanent store. The explanation is that the amnesic 
agents render the information unavailable on future tests (see also Hebb, 1949). 
Think of the stored information as cement. For it to become permanent it must 
have time to harden. If someone stirs it about before it hardens, its eventual form 
may be nothing like the form in which it was originally poured. 

Reversible Amnesia 

The original consolidation hypothesis centered on the idea that amnesic agents 
such as electroconvulsive shock caused permanent memory losses. Shock was 
supposed to completely block the “laying down,” or storage, of memory traces. 
But a number of experiments (see Quartermain & Botwinick, 1975) have shown 
that retrograde amnesia is not permanent. Information that has been “lost” 
through the introduction of amnesic agents can be regained, or “reactivated.” 
There are several techniques for observing the reversibility of amnesia. They 
include (1) the introduction of appropriate “reminders,” or prompts, and (2) the 
observance of “spontaneous” recovery of the lost information (Gerson & Hen¬ 
derson, 1978; Lewis, 1979; Miller & Springer, 1972). 

The reversibility of amnesia suggests that the information is available all along 
(stored) but is not accessible (cannot be retrieved) until appropriate reminders, or 
other critical conditions, are present. In other words, electroconvulsive shock 
does not prevent storage of information. It merely disrupts the subject’s ability to 
utilize or retrieve whatever information is stored. Suppose that you are listening 
to an old-time radio program. The heroine suffers a head injury in an automobile 
accident. She cannot remember a thing. Her family and friends are upset, to say 
the least. She contemplates ending it all. But with the careful help and prompting 
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of her family physician it all begins to come back. She “regains her memory” 
and marries Mark after all. The point here is that she never really lost anything. 
The information was still in storage, but she could not pull it out, or retrieve it, 
until prompted or reminded. The information was available but temporarily 
inaccessible. 

This view, emphasizing the disruption of retrieval processes, has received 
considerable support in the literature (Miller, Ott, Berk, & Springer, 1974; Miller 
& Springer, 1973, 1974; Spear, 1971, 1973). At the same time, proponents of the 
storage position have not disappeared (Gold & King, 1974; McGaugh & Daw¬ 
son, 1971). A fine healthy controversy exists, with interesting studies appearing in 
the current literature. 

In summary, we have looked at three approaches to the understanding of 
memory in this chapter—interference theory, decay, and consolidation. Each has 
something to offer, and at present none can be discarded out of hand. In Chapter 
9 we discuss three additional approaches to the study of memory—separate-store 
models, levels of processing, and semantic-network models. It is probably fair to 
say that the following chapter covers more modern interpretations than those 
discussed in this chapter; for the most part these newer models are described as 
being in the information processing tradition. 


SUMMARY 

1. Proactive inhibition (PI) refers to losses in our ability to remember a given set 
of materials that can be attributed to the interfering effects of previously 
learned materials. 

2. Retroactive inhibition (RI) refers to memory losses that can be attributed to 
the interfering effects of subsequently learned materials. 

3. Intertask similarity has been recognized as important in determining RI and 
PI effects. Osgood’s transfer and retroaction surface gives us a rough idea of the 
kinds of RI effect expected under conditions of varying intertask similarity. 

4. Interference effects are not limited to arbitrary tasks such as paired-associate 
learning. For example, they appear with connected discourse, with varying 
languages, and when animals are employed as subjects. 

5. The classical two-factor theory of forgetting proposes that interference 
effects are determined by competition at the time of recall and by the 
unlearning of first-list associations that occurs during second-list learning. 

6. Specific competition is presumed to contribute to both RI and PI, whereas 
unlearning affects RI but not PI. 

7. Unlearning effects are presumed to be separated from competition effects 
through the use of unpaced retention tests. If the subjects are given as much 
time as they wish to attempt retrieval, their performance will not be affected 
by competition. This type of unpaced retention test has been labeled MMFR. 

8. If the MMFR task eliminates competition and PI is determined cotnpletely by 
competition, then no PI should appear when the MMFR test is used. This 
straightforward prediction has not been confirmed. Proactive inhibition does 
occur with MMFR. This could mean that the theory needs revision or that 
MMFR does not eliminate competition. 
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9. Unlearning has been thought of as an extinctionlike process. During A—D 
learning the subjects sometimes make B responses, either overtly or covertly. 
The B responses, which are occurring as errors during A-D learning, are 
unreinforced (unconfirmed), and are thus thought to be analogous to re¬ 
sponses undergoing experimental extinction. This formulation provides a 
point of contact between human verbal learning and the basic animal litera¬ 
ture, wherein the concept of experimental extinction developed. 

10. If the extinction interpretation of unlearning is correct, then unlearned 
first-list associations should recover some of their strength with rest. In 
support of this interpretation spontaneous recovery of first-list associations 
has been demonstrated. 

11. There are two types of response competition. Specific response competition 
refers to the recall of one specific response blocking the recall of another 
specific response. Response-set interference refers to the blocking of one set 
of responses (for example, the entire set of first-list responses) by the tenden¬ 
cy to continue to give responses from the most recently learned list. 

12. Response-set interference is intended as a supplement to, rather than as a 
replacement for, the other interference mechanisms (that is, specific competi¬ 
tion and unlearning). 

13. The independent retrieval phenomenon refers to the fact that recall of B is 
just as likely given that D is recalled as it is given that D is not recalled. It has 
been suggested that this phenomenon invalidates the unlearning principle. A 
controversy surrounds the logic of this attack on the two-factor theory. 

14. In any case, alternative interpretations of interference effects have been 
proposed. For example, the encoding variability interpretation suggests that 
subjects connect B and I) to different attributes of A in an A—B, A—D 
situation. Then, when tested for B given A as a cue, the subjects display an RI 
effect because of a tendency to continue to select, or attend to, those attri¬ 
butes of A that were attended to during the more recent learning task (A—D). 

15. The encoding variability interpretation and the response-set interference 
hypothesis seem similar in that both suggest that RI results from a persisting 
involvement with the second task. The former emphasizes a persisting 
tendency to attend to certain stimulus attributes, whereas the latter empha¬ 
sizes a persisting tendency to give second-list responses. Both tendencies may 
contribute to overall RI effects. One interpretation need not invalidate the 
other. 

16. The unlearning principle has also been attacked by those who feel RI can be 
explained in terms of item inaccessibility rather than permanent unavaila¬ 
bility. 

17. Evidence for this idea comes from studies showing that a cued, or prompted, 
second test can lead to recall of items not recovered during a first test. 

18. The decay hypothesis, or the notion that memory traces somehow fade with 
the passage of time, has been proposed as an alternative to the interference 
mechanisms. The decay hypothesis has proved to be relatively unfruitful, for 
it is currently impossible to eliminate all sources of interference within a 
memory experiment. 

19. In retrograde amnesia studies, animals first learn a particular response. Then, 
after the administration of some amnesic agent, such as electroconvulsive 
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shock, it is found that the response has been lost. This effect appears to be 
similar to electroconvulsive shock treatment used with depressed human 
patients. 

20. Retrograde amnesia has been interpreted as disruption of the encoding and 
storage processes (the consolidation hypothesis). 

21. However, the discovery of the reversibility of retrograde amnesia has prompt¬ 
ed many to contend that the effect involves a disruption of retrieval process¬ 
es rather than encoding or storage processes. 
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THE COMPONENTS OF MEMORY: ENCODING, STORAGE, 

AND RETRIEVAL 

Before 1960, the two-factor theory of interference pretty much dominated the 
field of forgetting and retention. With its foundations in Hullian-type S-R 
psychology and its emphasis on such basic experimental phenomena as extinction 
and spontaneous recovery, it stood as the major, relatively uncontested, concep¬ 
tion of forgetting. But the last 25 years have seen a spectacular explosion in 
thinking and theorizing about human memory processes. In contrast to the early 
years we now have many models of memory, vyith more appearing all the time. 
Although the interference, decay, and consolidation approaches discussed in the 
last chapter are still influential, they appear to be undergoing replacement by the 
newer approaches described in this chapter. 

Our purpose now is to examine the most important of these newer models, 
including separate-store models developed during the 1960s, the levels-of- 
processing approach appearing in the early 1970s, and semantic-network models 
currently enjoying a wave of popularity. 

But before we move to consider specific memory models, we should spend a 
moment with the three aspects of memory: encoding, storage, and retrieval. The 
distinctions among these three elements, or components, of the overall memory 
process seem to permeate most models of memory. In fact, researchers loyal to the 
older interference approach to memory use these labels and think about memory 
in terms of them. Thus, the distinctions among encoding, storage, and retrieval do 
not belong to, or characterize, a single theoretical orientation, or model. 

It has become very fashionable to speak of encoding, storage, and retrieval as 
the essential components of the overall memory system. The distinctions among 
these components have been enormously helpful in stimulating and clarifying our 
thinking about some very complicated mental events. At the same time you 
should be aware that these components have not been separated or, for that 
matter, even defined to everyone’s satisfaction. Whether these three components 
truly represent distinct, independent components of the overall memory system 
remains to be seen. 

Murdock (1974) has characterized the three components about as well as 
anyone could. Encoding refers to the process by which the nervous system 
develops a representation of an external stimulus. The stimulus is a physical 
object or event that acts on, or has an effect on, the nervous system. This 
relatively lasting effect is referred to as the internal code for the external object or 
event. The nervous system now contains an encoded representation of the stimu¬ 
lus. Storage refers to the persistence of encoded material over time. Murdock 
draws a parallel between the storage process and the tape-recording of a musical 
composition. For all practical purposes the information stored on the tape does 
not change. Human memory rarely possesses the same degree of fidelity, but it 
does involve the storage of information over time. Retrieval , according to Mur¬ 
dock, refers to the utilization of this stored information. Such information may be 
available (stored) but not accessible (easily located and used). For example, we 
may know (have stored) the names of all of the United States, but at any given 
moment we may not be able to retrieve all of them. Hence, the information is 
available but not accessible. It is stored, but we may have trouble retrieving it. 
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Figure 9.1 General form of encoding, storage, and retrieval in memory. 

Murdock (1974) points out that, although the terms encoding, storage , and 
retrieval have become popular in recent years, similar concepts have been dis¬ 
cussed for decades (McGeoch & Irion, 1952; Melton, 1963). Figure 9.1 is a pic¬ 
torial representation of the relationships among encoding, storage, and retrieval. 

THE INFORMATION-PROCESSING APPROACH 
New ways of thinking about memory were stimulated by the advent of the 
computer age. Whereas theorizing about memory before 1960 was pretty much 
restricted to talking about stimuli, responses, and associations between them, the 
language of computers has since become commonplace. Mathematical models 
and computer-simulation models are on the upsurge. Psychologists have begun to 
try to describe memory in terms of information processing and have conceptual¬ 
ized human memory in terms of elements such as flow charts, outputs, control 
programs, buffers, instructions, execution, processing, and storage. 

Following are a few representative examples of this information-processing 
approach. Many models are available to choose from, and most of them are new. 
Many of them are vague, and they are often untested. We cannot be sure which 
ones will stand the test of time and which ones will fade from view. Although 
many of them appear to account for a good portion of the available experimental 
data, none of them has emerged as “the” model. 

The discussion in this chapter will take the following form. First, we shall 
consider several separate-storage models. It is characteristic of these models to 
propose two or more separate and distinct memory stores. For instance, many of 
them distinguish between short-term store (STS) and long-term store (LTS). 
Second, we shall consider the lev els-of-pro cessing approach to the understanding 
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of memory. This approach differs from the separate-store approach in that it does 
not think of the overall memory system as being composed of separate stores, or 
“boxes,” into which and out of which information is processed. Rather, as we 
have seen, the levels-of-processing approach argues that the durability of a 
memory trace is determined by the “depth” at which it is processed. The levels-of- 
processing approach is still an information-processing approach. But it is more of 
an approach than a theory or model, and it does not postulate separate stores. 
Third, we shall discuss several varieties of the newest form of model building, 
which can be labeled the semantic-network approach. According to this ap¬ 
proach, memory is best thought of in terms of complex structures, or networks, of 
interrelated information. 


SEPARATE-STORE MODELS 
Primary and Secondary Memory 

Let us begin our discussion of separate-store models with a relatively simple 
conception of memory. The model proposed by Waugh and Norman (1965) is 
given in Figure 9.2. It represents a relatively early attempt to distinguish between 
two types of memory and to couch a description of these two stores in informa¬ 
tion-processing language. 

According to Waugh and Norman (1965), every item (stimulus) that is per¬ 
ceived enters primary memory (PM). Once in PM, an item will be lost, or 
forgotten, unless it is rehearsed. Rehearsal can be overt or covert, intentional or 
unintentional, conscious or unconscious. If an item is rehearsed, it remains in PM 
and may enter secondary tnemory (SM). Secondary memory is considered to be a 
more permanent store. Once in SM an item need not be rehearsed to be main¬ 
tained. Waugh and Norman were trying to capture the following kind of event or 
experience. Suppose that you are presented with a series of digits (say, 87391063). 
If you read off this series and then prevent yourself from saying it over and over, 
you will forget it quite quickly (probably within a minute). But if you rehearse it, 
either silently or aloud, you can maintain it, you will not forget it, and you will be 
able to pronounce it on demand. If you rehearse it long enough, it will enter SM. 
In contrast to items in PM, items in SM are not lost when they are not rehearsed. 
If we rehearse the example sequence of digits enough times, it will become a 
relatively permanent memory, or enter our SM. We will be able to retrieve it even 
after it has left consciousness. 

The capacity of PM is limited. That is, there is a limit to the number of items we 
can hold in our PM. If we are given a string of dozens of digits, we probably will 
not be able to hold them all in our working memory. But most people can handle 
5, 6, 7, 8, or 9 items without too much difficulty. The exact capacity of primary 
memory is unknown (Glanzer & Razel, 1974). 

New items will “bump out” old ones. Suppose that we have been presented a 
series of eight words and are having a little trouble rehearsing them without 
forgetting any of them. Now the experimenter presents additional words. Accord¬ 
ing to Waugh and Norman, something has to go. If we are to take in the new 
words, then we must drop, or lose, a corresponding number of old items that we 
have been holding in PM. Old items will be lost as we try to accommodate the 
new ones. We cannot handle an unlimited number of items in our PM. 

An event in PM has not left consciousness. Primary memory is part of the 
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Stimulus 



Figure 9.2 The Waugh-Norman memory model. (After Waugh, N. C., and Norman, 

D. A. Primary memory. Psychological Review, 1965, 72, 89-104. Fig. 2, p. 93. 

Copyright 1965 by the American Psychological Association. Reprinted by permission.) 

psychological present. In contrast, an item recalled from SM has been absent from 
consciousness. Secondary memory represents the psychological past. According 
to Waugh and Norman (1965), PM contains a relatively accurate record of 
recently perceived events. Material stored in SM may be distorted and full of gaps. 

Obviously, this type of model is far from complete. Many questions are left 
unanswered. For example, how much rehearsal is required for different types of 
material to pass from PM to SM? Can some information pass directly through 
PM into SM without being rehearsed at all (for example, “traumatic events”)? 
How are materials brought back from SM to PM? Waugh and Norman never 
intended their model to be complete and perfect. They presented it as a beginning, 
or approach, to the understanding of human memory. It is more a way of thinking 
about memory than a detailed statement of all the processes involved in memory. 

One aspect of the Waugh and Norman model deserves special attention, 
because it is characteristic of many of the current information-processing models. 
The authors postulate two memory stores, one for materials retained for short 
periods of time and one for materials to be retained for longer periods of time. 
This distinction is the essence of the separate-storage model. The dichotomy 
between short-term store (STS) and long-term store (LTS), (newer terms for PM 
and SM) has become one of the focal differences between the information¬ 
processing approach and the older two-factor theory of interference. If you will 
recall, interference theory makes no mention of any distinction between short¬ 
term store and long-term store. There is only one memory system. Materials can 
be learned to different degrees, thereby becoming more or less resistant to 
interference, but memory over short periods is not conceived of as fundamentally 
different from memory operating over longer periods. 

The proposed dichotomy between LTS and STS suggests that the principles and 
mechanisms that govern the retention of materials for short periods are fun¬ 
damentally different from those that govern the retention of materials for longer 
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periods. We shall discuss the validity of this controversial dichotomy in detail in 
Chapter 10. 


The Atkinson—Shiffrin Buffer Model 

The Waugh—Norman model serves as a simple introduction to the realm of model 
building. It possesses the basic dichotomy between PM and SM, which is inherent 
in so many current separate—store models. But the model developed by Atkinson 
and Shiffrin (1965, 1968, 1971) provides a more full-blown example of the 
information-processing trend. 

A brief glance at the flow chart in Figure 9.3 will reveal certain characteristics 
that appear to be similar to those of the Waugh and Norman (1965) model. For 
example, the model contains a box labeled “short-term store” and a box labeled 
“long-term store.” 

Essentially, the concepts of short-term store (STS) and long-term store (LTS) 
introduced by Atkinson and Shiffrin are the same as the Waugh—Norman con¬ 
cepts of primary and secondary memory. But notice that the Atkinson—Shiffrin 
model introduces a new store, a third box called the “sensory register.” This third 
component of the overall memory system precedes STS. The model thus expands 
the number of stores from two to three. 

The principal characteristic of the sensory register is that information stored 
within it decays in a very brief period of time. The exact length of this decay 
interval is not known. It has generally been estimated at about .5 of a second, 
although some investigators have proposed slightly longer or shorter estimates. In 
contrast to STS, where information is often thought to be lost within, say, 5, 10, 
15, or perhaps 30 seconds, unrehearsed information in the sensory register is 
seldom thought to persist much longer than .5 of a second. The evidence leading 
to the postulation of the sensory register will be reviewed shortly. 

Although the Atkinson—Shiffrin concept of STS is very similar to the Waugh— 
Norman notion of primary memory, Atkinson and Shiffrin have expanded the 
idea a bit more than did Waugh and Norman (see Atkinson, Brelsford, & Shiffrin, 
1967; Loftus, 1971; Murdock, 1972). In addition to adding the sensory register 
they elaborated on the short-term store by postulating a rehearsal buffer (see 
Figure 9.3). This buffer is composed of a certain number of slots, or bins. These 
are locations within which item information is stored. The number of these slots 
is definitely limited (that is, the buffer is composed of r slots). In addition, the bins 
are ordered. There is a definite sequence to them. The order is conveyed by the 
notation r, r- 1, r- 2, ... 1. Slot r holds the newest information in the buffer, 
whereas Slot 1 holds the oldest, or the item information that has been in the 
buffer for the longest period of time. 

Because there is a fixed number of slots in the buffer, the inclusion of a new bit 
of information can be achieved only through the displacement of an old one 
(given that all the slots are full). New items are assumed to come in at the top, or 
to fill the r slot. Which item will be lost when a new item enters Slot r? In a very 
general sense, we can say that the older an item, the more likely it is to be 
displaced by the inclusion, of a new item. Obviously this statement is an oversim¬ 
plification. Atkinson and his colleagues were well aware that the displacement of 
an item is a complicated event (see Phillips, Shiffrin, &C Atkinson, 1967). 
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Figure 9.3 A version of the Atkinson-Shiffrm memory model. (After Atkinson, R. C., and 
Shiffrin, R. M. The control of short-term memory. Scientific American, August 1971, pp. 
82—90. Copyright © 1971 by Scientific American, Inc. All rights reserved.) 

Without going into any great detail we can grasp the complexity of item 
displacement by imagining a boy picking up pebbles on the beach. His hand will 
hold just so many stones. Each time he picks up a new one, he must, if he wishes 
to keep it, drop one of the old ones. Which one will he drop? Just imagine what 
goes into this “simple” decision. It is true that there may be a tendency to drop the 
stone that has been held the longest. But the boy will also probably consider 
which pebbles he likes the most and which ones he has examined closely. In an 
analogous manner the process of dropping items from STS is probably multiply 
determined. 

Be that as it may, the system does postulate an additional mechanism concern¬ 
ing the displacement effect. Once an item is displaced from, say, a slot near the 
bottom of the column of slots, what happens to the information in the remaining 
slots? According to the theory, all items older than the displaced item hold their 
place. They stay in the same slots. All items newer than the displaced item move 
down one slot. Obviously this postulation complicates matters, for it requires that 
older and newer items be given separate consideration. 
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Figure 9.4 A summary separate-store model of memory. 

In summary, the Atkinson—Shiffrin model is similar to the earlier Waugh- 
Norman model, but it is more complex. Specifically, it adds the third box, the 
sensory register, and it expands on the functioning of STS, or PM. 

A Summary Separate—Store Model 

Having briefly looked at the Waugh—Norman and Atkinson—Shiffrin separate- 
store models, including some of the similarities and differences between them, it is 
time to consider a general, summary model of the separate-store variety. Such a 
model is shown in Figure 9.4. 

In the following sections in this chapter we shall look at the major components 
of this summary model and try to do two things. First, we shall look at the general 
characteristics of each component as they can be distilled from the literature. 
Second, we shall look at some of the data and evidence that have been marshaled 
in support of these components. Beginning at the left of Figure 9.4, we shall 
consider each of the separate-store components individually. 

Sensory Memory: Iconic and Echoic 

In this section we discuss what Atkinson and Shiffrin have called the sensory 
register, or what others have called sensory memory. Basically, sensory memory is 
thought of as a memory store with a very brief duration period, something like .5 
of a second. It is supposed to contain very basic, unelaborated impressions of the 
external environment, which decay rapidly unless processed into one of the other 
stores (that is, STS and LTS). 

Most research has focused on visual and auditory sensory memory, although 
there may be sensory-memory systems associated with the other senses. There 
have been a few investigations of sensory memory in the minor senses such as 
touch, taste, and smell (see Hill & Bliss, 1968), but we shall restrict our attention 
to visual and verbal sensory memory. If you follow the instructions in Box 9. 1, 
you may get a better idea of what a sensory memory is. 


248 PART FOUR/RETENTION 
































Box 9.1 Examples of Sensory Memory 

Visual sensory memory. Take a flashlight into a dark room and turn it on. Swing 
your wrist around in a circular motion, shining the flashlight onto a distant wall. If 
your motion is quick enough, you will see a complete circle. Your visual sensory 
memory stores the beginning of the circle while you examine the end of the circle. 

Auditory sensory memory . Take your hands and beat a quick rhythm on the 
desk. Can you still hear the echo after the beating is finished? 

Tactile (touch) sensory memory. Take the palms of your hands and quickly rub 
them along a horizontal edge of your desk, moving your hands so that the heel 
touches first and the fingertips touch last. Can you still feel the sharp edge, even 
after your hand is off the desk? 

From Cognition by Margaret Matlin. Copyright © 1983 by CBS College Publishing. Reprinted by permission of Holt, Rinehart 
and Winston, CBS College Publishing. 


ICONIC MEMORY. Early evidence for the existence of a visual sensory memory, 
sometimes called iconic memory , came from the work of Averbach and Coriell 
(1961) and Sperling (1960, 1963). What Sperling (1960) showed was that, when 
exposed to a visual array for a very brief, single presentation, people can remem¬ 
ber a great deal more than was formerly believed to be the case, at least for a brief 
period. It was this work that led to the postulation of the visual sensory memory, 
or the memory that takes in a great deal of visual information in a very accurate 
manner but cannot hold that information for very long. 

Sperling used a tachistoscope, which is a piece of equipment that will present 
visual stimuli for very, very brief periods. He presented rectangular arrangements 
of letters such as those on the left side of Figure 9.5. These arrays were flashed for 
50 milliseconds (1 /20th of a second). The subjects then were asked to report what 
they had seen. Without any special instructions they were able to report no more 
than four or five letters correctly. Now comes the interesting part. By using a 
prearranged signal, which was presented immediately after presentation of the 
letter array, Sperling asked the subjects to recall a specific row. A high-pitched 
tone meant that the subjects were to recall the top row. Low and intermediate 
tones indicated to the subjects that they were to recall the bottom or middle row, 
respectively. The surprising result was that the subjects could recall any given row 
with 100% accuracy. This finding suggests that there is a very rapidly decaying 
“visual image,” “afterimage,” or “visual sensation” that persists beyond the 
termination of the actual stimulus. It is as though the subjects “see” the entire 
array of letters for a moment after the actual physical stimulus has been ter¬ 
minated. It is there for them to read off. But this “afterimage” is not there for 
long. When asked to report the top row, the subjects can do so without error. 
They merely refer to the “visual image,” or whatever you wish to call it, and read 
off the correct letters. But while they are reading off, or encoding, the top row, the 
other rows are fading rapidly. By the time they have finished reading off the top 
row, the lower rows have faded to the point at which the subjects can no longer 
report them. The Sperling test procedure forces the subjects to attend to some 
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Step 1 

Experimenter presents 
array for 
— second. 
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F 


Step 2 

Tone signals 
which row 
subject is 
to recall. 


High tone means recall top row. 
Medium tone means recall middle row. 
Low tone means recall bottom row. 


Step 3 

Subject tries 
to recall 
correct row. 


For example, 
high tone signals 
subject to recall 
X G 0 B. 


Figure 9.5 Sperling’s demonstration of visual sensory memory. 


portion of the total array. While they are attending to this selected portion, the 
entire array is decaying, rendering a report of the remaining items impossible. 

This is a rather interesting result. It suggests that incoming visual information is 
present as a rapidly decaying “visual image.” If encoded, or read off from this 
visual sensation, the information moves into STS. If it is not attended to almost 
immediately, it will be lost. Thus, in the separate-state model it seems to be 
attention that moves information from the sensory register into STS. Those few 
stimuli that we can attend to , before they fade away, will be processed into STS 
(see Figure 9.4). The rest will be gone, and unless they are presented again, they 
will never be remembered (see also Sakitt & Appelman, 1978). 

ECHOIC MEMORY: STIMULUS-SUFFIX EFFECT AND PAS. Given that Sperling’s 
(1960) experiment demonstrates the existence of a visual sensory memory, what 
evidence is there for auditory sensory memory? This form of very short-term 
memory is often referred to as echoic memory, presumably because like an echo it 
involves a faithful, but short-lived, trace of auditory stimuli (see Massaro, 1970, 
1972; Neisser, 1967). 

Much of the evidence for echoic memory comes from the current work being 
done on the stimulus-suffix effect (Crowder, 1974, 1978; Greenberg 8c Engle, 
1983; Kallman tk. Massaro, 1983; Madden 8c Bastian, 1977; Penney, 1979; 
Watkins 8c Todres, 1979). The stimulus-suffix effect is a curious one. Subjects read 
a list of words and then try to recall them in order. If the last word in the list is 
followed by a redundant word, which is a word that follows all lists and need not 
be recalled, then the last word in the list is difficult to recall. For example, suppose 
that you are told you will be hearing a list of words and that following the list you 
will hear the word jump. You are told that you do not need to recall jump and 
that jump will follow other lists that you will be trying to recall; it will be 
redundant in the experiment. Somehow the inclusion of the word jump , a 
redundant suffix, disrupts your recall of the last, or the last few, words in the list. 

Let us say that you are asked to listen to and recall 20 lists of words. Half of 
these lists are followed by a redundant suffix such as the word jump. The 
remaining lists are not. Over the entire series of lists your recall of the last word in 
the lists containing the suffix will be lower than your recall of the last word in the 
lists containing no suffix. Why? Crowder has suggested that this suffix effect 
occurs because of the existence of an auditory sensory memory that he calls 
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precategorical acoustic storage, or PAS. Precategorical acoustic storage is pre¬ 
sumed to be a sound-based store that can retain a few sounds for very brief 
periods. By adding that last suffix we are overloading PAS, or bumping out the 
last word(s) in the list. Investigators in this area talk about the redundant suffix 
“overwriting” or “displacing” the last few items in the serial list that are being 
held in PAS. Without a suffix the last few words in the list go into PAS and are 
maintained there long enough to be recalled very well on the serial recall test. But 
when a suffix is added, it takes the place of the last word in the serial list and 
thereby reduces our ability to recall the bumped, or displaced, last serial word. 

Notice that PAS is thought of as a store with a very limited capacity. It can hold 
only a few items, and the addition of even one more item, such as a redundant 
suffix, bumps out old items. This is in contrast to the current conception of the 
visual sensory memory, which holds that large amounts of visual stimulation are 
held for brief periods of time. Thus, even though iconic and echoic memory are 
both thought of as sensory memories, they do differ, with the former possessing a 
larger capacity than the latter. 

Do not think that the suffix effect and the postulated PAS are accepted by all 
investigators. There is a good deal of controversy surrounding these elements, and 
the exact nature of both the effect and the proposed PAS is under close scrutiny 
(see Ayres, Jonides, Reitman, Egan, & Howard, 1979; Baddeley & Hull, 1979; 
Parkinson, 1978; Richardson, 1979). 

Short-Term Store 

For years, the investigation of short-term memory, or short-term store, dominated 
the information-processing approach to memory. Recently, more and more atten¬ 
tion has been devoted to the examination of sensory memory and long-term 
memory. But a huge quantity of data concerned with short-term memory process¬ 
es has accumulated. Hence, this section will only serve as an introduction to the 
area of short-term memory. We shall be referring to short-term memory in 
connection with many different concerns throughout the remainder of this text. 
For now, our purpose is to get a rough view of short-term store. 

As we have already seen, short-term store, or STS, is characterized as a 
limited-capacity system, which handles information that is to be stored for 
relatively short periods. At present the time limit imposed on STS seems to be 
fairly arbitrary. Many investigators tend to think of STS as a system in which 
information will be lost within 30 seconds or so unless it is processed further. 

Items in STS can be maintained through rehearsal. If items are not rehearsed, 
they will be lost from STS (Anderson & Craik, 1974). Thus, there may well be 
two ways in which items are lost from STS. First, old items may be displaced by 
new, incoming items. Second, items may be lost if they are not rehearsed. 

Item rehearsal does something in addition to maintaining information in STS. 
Namely, rehearsal of information held in STS may contribute to its transfer to 
LTS. Notice that rehearsal may contribute to transfer. It does not necessarily lead 
to transfer to LTM (Jacoby &C Bartz, 1972). Items may be rehearsed in STS 
without being transferred to LTS, but for items to be transferred from STS to LTS 
they must be rehearsed (Atkinson & Shiffrin, 1971). 

In general, then, the longer an item has been maintained in STS by rehearsal, 
the more likely it is to be transferred to LTS. The longer we hold information (for 
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example, the sequence BXGBJKL) in STS by rehearsing it, the more likely it is to 
become part of our “permanent” memory. 

An item can be in both STS and LTS at the same time. For example, think of 
your mother’s name. When you think of it, it will be in both STS and LTS. An 
item can be in LTS but not STS. For example, all of the things we “know” but are 
not now thinking of are in LTS but not STS. An item can be in STS but not LTS. 
For example, NELZIPRELLY is now in your STS. But it probably will not enter 
your LTS unless, for some peculiar reason, you rehearse it several times (but see 
Brown & Kulik, 1977). i 

We shall return to a consideration of STS in the next chapter. At that time we 
shall consider the validity of the construct. We shall ask whether it is even 
necessary to distinguish between STS and LTS, and we shall trace some of the 
experimental data that bear on this issue. 

Long-Term Store 

Until recently, long-term store, or memory over long periods, did not receive as 
much attention as short-term store within the information-processing tradition. 
Although interference theory has been interested in LTS for many years, informa¬ 
tion-processing theoreticians focused first on short-term memory. But things have 
changed rapidly in the last few years, and there have been many new and exciting 
developments in the area of LTS. We shall be discussing many of these LTS issues 
in upcoming chapters, particularly Chapters 10 and 11. 

For now, our purpose is to look at the general dimensions of what is commonly 
referred to as LTS and at its relationships to sensory memory and STS. Detailed 
analyses and new developments will be considered later. 

Generally speaking, information is presumed to enter LTS from STS through 
the process of rehearsal. Once information is in LTS, it is no longer rehearsed 
(unless it is brought back to STS). Information is often assumed to be lost from 
LTS through the combined effects of both decay and interference. The decay rate 
is presumably much slower than that of STS. Although information is often 
thought to decay from STS in a matter of seconds, estimates of the LTS decay rate 
range up to years. Once in LTS, information is presumed to be stored quite 
permanently. (The exact values of the decay rates associated with the sensory 
register, STS, and LTS are not particularly important at this point in our investiga¬ 
tions. In fact, psychologists are not even sure that decay necessarily operates in all 
of these stores.) 

Let us consider the proposition that LTS is affected by interference factors. 
When information-processing theoreticians speak of interference effects in LTS, 
many are presumably referring to such factors as proactive and retroactive 
inhibition and the associated concepts of unlearning and response competition. 
Thus, the information-processing models often acknowledge and attempt to 
incorporate the interference mechanisms outlined in the last chapter. There has 
been a tendency to locate these interference effects in LTS and to leave STS and 
sensory memory relatively free of them. In fact, one of the most controversial 
distinctions betweerrSTS and LTS has been based on the notion that information 
in STS is not subject to interference effects. Many psychologists have raised 
questions concerning the validity of this distinction, as we shall learn in the next 
chapter. 
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Table 9.1 Commonly Accepted Differences Among the Three Stages of Verbal Memory 


FEATURE 

SENSORY REGISTER 

SHORT-TERM STORE 

LONG-TERM STORE 

Entry of 
Information 

Preattentive 

Requires attention 

Rehearsal 

Maintenance of 
information 

Not possible 

Continued attention, 
rehearsal 

Repetition, organization 

Format of 
information 

Literal copy of 
input 

Phonemic, probably 
visual, probably 
semantic 

Largely semantic, 
some auditory 
and visual 

Capacity 

Large 

Small 

No known limit 

Information loss 

Decay 

Displacement, possibly 
decay 

Possibly no loss; loss 
of accessibility or 
discriminability by 
interference 

Trace duration 

.25—2 seconds 

Up to 30 seconds 

Minutes to years 

Retrieval 

Readout 

Probably automatic; 
items in conscious¬ 
ness; temporal/ 
phonemic cues 

Retrieval cues, possibly 
search process 


Adapted from Craik, F. I. M., and Lockhart, R. S. Levels of Processing: A Framework for Memory Research. Journal of Verbal 
Learning and Verbal Behavior , 1972, 11, 671-684. Copyright 1972 by Academic Press, Inc. Reprinted by permission. 


In summary, the separate-store models propose that information is transferred 
from sensory memory to STS through a process of attention. Otherwise, informa¬ 
tion decays from sensory memory in a fraction of a second. The short-term store 
has a limited capacity. Information is transferred fror; STS to LTS through a 
process of rehearsal, and unless it is rehearsed, such information may be lost from 
STS in a matter of seconds. Information may also be lost from STS through a 
process of displacement. Information stored in LTS, in contrast, is quite perma¬ 
nent. It is subject to a very slow decay process and to interference effects. Thus, 
not everything that enters the sensory register reaches STS, not everything that 
enters STS is transferred to LTS, but information that does make it all the way 
through to LTS is considered to be relatively permanent. 

Table 9.1 summarizes the differences among the three memory systems we have 
been discussing, as envisioned by Craik and Lockhart (1972). Again keep in mind 
that the values contained in Table 9.1 are only rough estimates. Different re¬ 
searchers will assign different values to some of these factors. Not enough is 
known to be certain of the exact values. 

Control Processes and Structural Properties 
Atkinson and Shiffrin (1968, 1971) distinguish between the fixed, structural 
properties of the overall memory system and control processes. The distinction is 
an important one. Control processes refer to the fact that the individual can go 
about using the fixed structural properties of the system in many different ways. 
For example, STS has the capacity to hold a certain amount of information. This 
capacity represents a fixed, structural property of the system. But whether we 
choose to fill STS is another matter entirely. We do not have to fill it; we can 
partially fill it or leave it empty. Just because someone flashes a lot of words in 
front of us does not mean we have to put them in our STS. We do, after all, have 
some choice in the matter. In addition, we can maintain information in STS or 
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“let it go.” We can transfer information to LTS or refrain from doing so. We can 
pull information back from LTS into STS and so on. The structural properties of 
the system set limits on what we can do. For example, we cannot hold an 
unlimited amount of information in STS. But within the limits of its structural 
properties we can “use our memory.” These choices, decisions, and strategies are 
what Atkinson and Shiffrin call control processes. 

It is simple to see that the separate-store model becomes very complicated when 
we begin to consider control processes: We have a willful, unpredictable human 
being on our hands. We must consider all his 'or her needs, wants, wishes, and 
decision processes. At the same time, the introduction of control processes im¬ 
mediately makes the model much more appealing and much more closely related 
to the world of human experience. 

Atkinson and Shiffrin place considerable emphasis on control processes and 
their relation to the fixed structure of the model (see also Hinrichs & Grunke, 
1975). They have many interesting things to say about control processes, and you 
are referred to their work for a full explanation of the concept. For the purpose of 
this text we will consider a couple of examples of the effects of human strategies 
on the use of the system. These examples are merely representative of many 
investigations and should be understood in that light. (In a sense, most of Chapter 
11, in which we discuss organizational strategies in memory, is concerned with 
control processes. So we shall be having a great deal more to say about control 
processes as the text progresses.) 

The first example of control processes is a study done by Bellezza and Walker 
(1974). This interesting experiment is based on the assumption that what we do 
with information within the memory system depends on what we want to do with 
it, or feel we need to do with it. Accordingly, one group of subjects was presented 
with a list of seven words and asked to recall all the words immediately following 
presentation. The subjects were then presented with another list of words and 
asked for immediate recall of that list, and so on, until seven different lists had 
been presented and tested. Before any lists were presented the subjects were told 
that they would receive “1 point” for every word recalled correctly. These 
subjects were free to rehearse in such a manner as to maintain each list in STS. 
There was no need to try to transfer the information to LTS, for they believed that 
they would be given only an immediate-recall test. There was no need to store the 
information permanently, for storage in STS would be more than adequate to 
maintain the material through the immediate-retention test. However, these sub¬ 
jects were “fooled” by the experimenter. Following the presentation and testing 
of all seven lists, the experimenter asked the subjects to try to recall all of the 
words from all seven lists in one final retention test. The subjects were told that 
they would be given “10 points” for each correct item on this final recall test. 
Obviously, these subjects were not ready for this development. They had not 
tried to transfer information from STS to LTS in preparation for this final test. 
They did very well on the immediate tests but very poorly on the final test. The 
assumption here is that, by the time the final test arrived, materials had been lost 
from STS. In addition, they had not been put into LTS, hence the poor perfor¬ 
mance on the final test. 

Another group of subjects was treated in exactly the same manner, except that 
these subjects were told before being exposed to any of the lists that they would 
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recall recall 


Figure 9.6 Number of words recalled per list in the storage (o) and coding (•) conditions. 
(Adapted from Bellezza, F. S., and Walker, R. J. Storage-coding trade-off in short-term 
store. Journal of Experimental Psychology, 1974, 102, 629—633. Copyright 1974 by the 
American Psychological Association. Reprinted by permission.) 

be tested for immediate retention and that they would be given the final recall 
test. Correct items in immediate recall were to receive “1 point,” whereas correct 
items in final recall would receive “10 points.” If you were placed in a similar 
situation, you would probably try to ensure that you would be able to remember 
as much as possible on that final test, because that is where the larger payoff 
would be. You would rehearse in such a way as to maximize transfer from STS to 
LTS. You would not especially care about what stayed in STS so long as you were 
sure lots of information was going into LTS. The results of this strategy are ap¬ 
parent in Figure 9.6. As you can see, the storage condition (told only about the im¬ 
mediate tests) did well on the immediate tests and poorly on the final test. The 
coding condition (told about both immediate and final recall) did better than the 
storage subjects on the final test but not so well as that group on the immediate 
tests. Because of their awareness of the big, 10-point payoff the coding subjects 
utilized the fixed structural properties of the memory system such that storage in 
LTS was maximized. The effort that these subjects put into transferring materials 
to LTS seems to have somehow detracted from their maintenance of information 
in STS. We can only speculate about the reasons for this poor performance in STS 
by the coding subjects. For example, they may have merely rehearsed fewer of the 
presented items a greater number of times to ensure that they would be trans¬ 
ferred to LTS. 

What the study demonstrates is that, depending on the needs of the subject, the 
memory system can be used in different ways to achieve different ends. It also 
suggests that rehearsal is not a unitary process. That is, there appear to be certain 
types of rehearsal that maximize storage in STS and other rehearsal strategies that 
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maximize storage in LTS (Craik & Watkins, 1973; Jacoby, 1973; Woodward & 
Bjork, 1973). 

If you will recall, we discussed Type I (maintenance) rehearsal and Type II 
(coding) rehearsal in Chapter 5. Type I rehearsal is the kind used by the subject to 
keep information “in mind,” or in STS. Type II rehearsal is what we use to 
transfer information from STS to LTS (see Woodward, Bjork, & Jongeward, 
1973). Although Bellezza and Walker did not use the Type I and Type II labels, 
the distinction between these two forms of rehearsal is the basis of their study. 

The distinction between control processes and the structural properties of the 
memory system is becoming ingrained in our ways of thinking about memory, 
and it forms the impetus for a good deal of research. For example, Peterson, 
Thomas, and Johnson (1977) point out that for certain retention tests it might be 
best to try to remember items in terms of visual images, whereas other retention 
tests might be best attacked through the use of verbal rehearsal (see also Rundus, 
1977). 

CULTURAL DIFFERENCES IN CONTROL PROCESSES. Wagner (1978) has done some 
cross-cultural work and come to the conclusion that the structural properties of 
the human memory system are universal. That is, all members of the species (for 
we are, after all, members of the same species) possess the same structural 
properties. But the control processes used in different countries and cultures 
probably do vary. 


DIRECTED FORGETTING. Bjork’s (1970, 1972) work on directed forgetting repre¬ 
sents another good example of the operation of control processes in memory. In 
one situation he presented four successive paired associates. Color background 
cues instructed the subjects either to forget the first two items or to retain them for 
possible future testing. Bjork found that retention of the last two items was 
superior when the subjects had been informed that they need not try to maintain 
the first two. He speculates that the “forget” cues initiate control processes which 
free rehearsal time for the last two items and which somehow differentiate the 
to-be-remembered items from the to-be-forgotten items (see also Bjork & Geisel- 
man, 1978). 

The distinction between control processes and structural properties need not be 
limited to the separate-store models that we have been considering, even though 
the distinction was originally developed in connection with the Atkinson-Shiffrin 
model. It is a general distinction that can be useful no matter what model is under 
consideration. We can begin to see the kind of fun that psychologists can have 
with such a model. Propositions based on the interaction of the structure of the 
system and the control processes can be tested in the laboratory and related to 
the world outside the laboratory. The control processes “put the human back” in 
the system. 


THE LEVELS-OF-PROCESSING APPROACH 
Introduction 

The preceding sections concerned with separate-store models of memory should 
convey the fact that there is not much agreement about exactly how many and 
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what kind of stores, or boxes, we should include in our formulation of the 
memory process. But the controversy is not limited to disagreement among 
separate-storage modelers. Specifically, the levels-of-processing approach chal¬ 
lenges the entire notion of separate stores. According to this viewpoint, which 
was introduced in Chapter 5, the durability of a memory trace is determined by 
the “depth” to which it is processed. The deeper the processing, the more durable 
is the trace. As you will recall, if the structural properties of a piece of information 
(for example, what it looks like) are attended to, then processing will be relatively 
shallow, and the trace will be weak. But if the acoustic, or phonemic, properties 
(what it sounds like) are attended to, the memory trace will be more durable. 
Finally, if the semantic properties (for example, the meaning of a word) are 
attended to, then processing is at the deepest level, and the memory trace will be 
most durable. (Some of the tasks that have been used to produce different levels of 
processing are contained back in Table 5.1.) 

This approach, which is not really a theory at all, does not postulate separate 
stores. It argues that we can think of a continuum of memory strengths, which is 
determined by level of processing, from shallow (weak) to deep (strong). It says 
nothing at all about discrete boxes. In this sense it is closer to the interference 
approach, in that it conceives of memory as a unitary process in which the 
strength of a memory trace can be increased through rehearsal. But even though it 
is similar to interference theory in this one sense, we should keep in mind that the 
levels-of-processing approach is really an information-processing approach. 
Thus, it has something in common with both the interference and separate-store 
conceptions of memory (see Baddeley, 1978; Glanzer & Koppenall, 1977). 

The basic experimental design used to study levels of processing is an incidental 
learning paradigm. As noted, Craik and Tulving (1975) had subjects make judg¬ 
ments about words presented for very, very brief moments. In the shallow¬ 
processing condition the subjects judged whether the words were typed in capital 
letters. In a second condition the subjects judged whether the stimulus word 
rhymed with a designated word. Finally, in the deepest level of processing the 
subjects judged whether the stimulus word fit into a sentence frame. Then, in a 
surprise recall test it was found that recall increased as depth of processing 
increased. 

There have been many variations on this theme of incidental learning. A 
number of experiments supporting the levels-of-processing hypothesis were de¬ 
scribed in detail in Chapter 5. Generally speaking, whatever the particular form 
of processing required by the experimenter, it has been found repeatedly that 
retention increases as depth of processing increases (Bellezza, Cheesman, & 
Reddy, 1977; Cermak &; Reale, 1978; Craik & Lockhart, 1972; Hunt & Mitch¬ 
ell, 1978; McDowall, 1979; Moeser, 1983; Smith, Theodor, & Franklin, 1983). 

But in spite of its popularity the levels-of-processing approach has been severely 
criticized. As noted in Chapter 5, foremost among these criticisms has been the 
assertion that deep processing does not always or necessarily lead to greater trace 
durability. In addition, information processed on a shallow level can be quite 
durable (see Kolers, 1976; Morris, Bransford, &c Franks, 1977; Nelson, 1977; 
Nelson & Vining, 1979). 

Furthermore, Nelson (1977) argues that there is a circularity to the levels 
approach, because we have no independent measures of what we mean by 
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“depth.” First we say that, if processing is deep, retention will be better. Then we 
say that, if recall was better, processing must have been deep. 

Baddeley (1978) feels that the levels approach can be used as a general rule of 
thumb but that there are too many exceptions to the rule to utilize it in any hard 
and fast manner (see Hunt, Elliott, & Spence, 1979). For example, Eysenck and 
Eysenck (1979) report some interesting results having to do with the amount of 
effort it requires to pull information back into consciousness after it has been 
processed to different levels. First, they had subjects process words at different 
depths. Then they had them try to do two things at once: recall the words and 
press a lever when a sound occurred. They found that the lever-pressing latency 
went up (it took them longer to press the lever in response to the sound) when 
subjects were trying to recall deeply processed and heavily elaborated words. In 
other words, trying to recall deeply processed words “used up” more of the 
subjects’ capacity to process information than did the retrieval of words pro¬ 
cessed at a shallow level. 

This is an interesting result because, whereas most other data indicate that deep 
processing leads to greater trace durability, these authors report that retrieving 
deeply processed material requires more attention and effort than does the 
retrievel of shallow-processed information (see also McDaniel, Friedman, & 
Bourne, 1978). In other words, a great deal of the data are consistent with the 
levels approach, but not all of them. 

Evidence for Elaboration and Distinctiveness 

In Chapter 5 the concepts of elaboration and distinctiveness were introduced in 
connection with efforts to understand why deeper processing leads to better 
retention (see Walker & Jones, 1983). If you will recall, the elaboration hypoth¬ 
esis maintains that semantic processing leads to richer, more elaborate encoding, 
which in turn facilitates retention. The distinctiveness hypothesis holds that 
deeper processing helps make the to-be-remembered stimulus more distinct from 
other stored information and, therefore, more recallable. Although Chapter 5 
described the distinctiveness and elaboration hypotheses, it did not actually 
present any data in support of them. It is to that task that we now turn. 

Let’s first look at two experiments that support the elaboration hypothesis. 
Klein and Saltz (1976) ran a typical levels-of-processing experiment in which 
subjects first made semantic judgments about stimuli and were then tested for 
recall of those words in a surprise test. But the interesting variation in their 
experiment was that some subjects made only one judgment about the stimuli (for 
example, Is this stimulus pleasant or unpleasant?), and other subjects made two 
judgments about the same stimuli (for example, Is this stimulus pleasant or 
unpleasant, and is it big or little?). In agreement with the elaboration hypothesis, 
words rated on two dimensions were recalled more often than words rated on a 
single dimension. 

Craik and Tulving (1975) developed a second means of increasing semantic 
elaboration. Subjects were unexpectedly asked to recall words after they had 
decided whether those words would fit into a sentence frame provided by the 
experimenters. The crucial aspect of the study was that there were three different 
levels of sentence-frame complexity, as follows: 
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Simple: She cooked the_ 

Medium: The ripe_tasted delicious. 

Complex: The small lady angrily picked up the red_ 

The results showed, among other things, that recall of the correct words went up 
as the degree of sentence-frame complexity went up, especially when subjects 
were provided the frames as cues, or prompts. These results support the elabora¬ 
tion hypothesis. 

Now what about experimental support for the distinctiveness hypothesis? Are 
more distinctive units recalled more readily than less distinctive ones? The 
answer seems to be yes. Words that have distinctive unusual shapes (lymph, 
khaki, afghan) are said to be orthographically distinct. Hunt and Elliott (1980) 
have shown that these words are better recalled than more commonly shaped, less 
orthographically distinct words such as leaky, kennel, and airway. There is no 
doubt that it is the peculiar shape of these words that leads to their better recall. If 
the words are presented auditorily, there is no difference in recall of the two types. 
Furthermore, if all words are presented in capital letters, then LYMPH, KHAKI, 
and AFGHAN are no better recalled than LEAKY, KENNEL, and AIRWAY. 
The different heights of lower-case letters presumably create the distinctiveness 
that leads to better recall (see also Moscovitch & Craik, 1976). 

In summary, the original levels-of-processing conception of memory has been 
modified and expanded by the recent emphasis on elaboration and distinctive¬ 
ness, and it seems that both elaboration and distinctiveness can lead to better 
recall. 


Encoding Specificity 

Another development relative to the levels-of-processing approach is best intro¬ 
duced by an experiment. In an encoding-specificity experiment (see Thomson & 
Tulving, 1970) all subjects are shown a list of words that they will later try to 
recall. Some subjects see only this list, and others see the list plus a word related to 
each of the to-be-recalled words. For example, one subject might see wood alone, 
and another subject might see the to-be-recalled word wood plus tree. Then, at 
the time of recall half of each kind of subject was given tree and asked to recall 
wood; the remaining subjects were not shown tree when attempting to recall 
wood. 

The results were these: Subjects who saw tree both at the time of learning and 
at the time of recall did best in recalling wood. That was not surprising. But what 
was surprising was that the group that never saw tree was better at recalling 
wood than were the groups that saw tree either at the time of learning or at the 
time of recall. 

This kind of experiment demonstrates what Tulving and his associates call the 
encoding-specificity principle: Recall will be best when the cues present at the tune 
of learning and the time of recall are most alike. If you learn wood when tree is 
present, you will best be able to recall wood if tree is present at the time of recall. 
But if tree is not present during the learning of wood, then wood will best be 
recalled in the absence of tree. To maximize retrieval, cues present at input should 
also be present at recall. 
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The levels-of-processing approach emphasizes encoding processes in memory; 
it states that our ability to remember material will be determined by how we 
encode that material. The encoding-specificity principle extends the analysis to 
include the idea that our ability to remember material will also be affected by 
retrieval conditions as well as encoding processes. For example, suppose we 
encode a given item to a certain depth. Recall of that item will not be completely 
determined by that depth of processing; it will also be affected by what cues are 
present at the time of recall. If the cues at recall are the same as those that were 
present during learning, then recall will be good. If cue conditions during learning 
and recall differ greatly, recall will suffer. 

SEMANTIC-NETWORK MODELS 
Common Characteristics 

Having considered separate-store models and the levels-of-processing approach 
we now turn to the third, newest, and last type of memory model to be discussed 
in this chapter— semantic-network models. The phrase refers to quite a wide 
range of theoretical efforts, but most of these models have at least two attributes 
in common. First, they deal with the storage of semantic (meaningful) material. In 
preceding sections we have sometimes discussed the retention of relatively 
meaningless materials (such as nonsense syllables) that have been learned in the 
laboratory. Now we are going to concentrate on memory for meaningful mate¬ 
rials such as words, phrases, sentences, and prose. We are beginning the study of 
our knowledge about language and the meaningful world around us. 

Second, semantic-network models assume that stored knowledge is best 
thought of in terms of multiple interconnected associations, relationships, or 
pathways. Instead of thinking of single stimuli being connected to single re¬ 
sponses (the old S—R conception) semantic-network models think of a unit of 
information as being embedded in an organized, structured network composed of 
other units and their functional relationships to one another. As we shall see, the 
exact nature of these functioning networks varies depending on the type of 
semantic-network model we are considering. 

Our emphasis in this section will be on lexical memory , or memory for words. 
A lexicon is a group of ordered words. For example, a dictionary is a lexicon. So 
when we speak of lexical memory, we are focusing on our store of words (our 
vocabulary) as distinct from grammar, sentence memories, experiences, or any 
other kind of stored information. When we study lexical memory, we want to 
know how words are encoded, stored, and retrieved (see Chambers, 1979; Clark 
&c Gerrig, 1983; Glanzer & Ehrenreich, 1979; Taft, 1979). Later, in Chapter 13, 
we shall see how semantic-network models also help us understand memory for 
more complex forms of information such as sentences, prose, and general knowl¬ 
edge. In this chapter we restrict our attention for the most part to the simplest 
applications of network models by focusing on lexical memory. We now turn to 
the first of four semantic-network models. 

Hierarchies 

A number of authors have proposed that the best way to think about the structure 
of lexical memory is in terms of hierarchies. Collins and Quillian (1972) have 
developed an interesting hierarchical conception of the lexicon (see Figure 9.7). 
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Figure 9.7 A memory model involving a three-level semantic network, or hierarchy. 
(Adapted from Collins, A. M., and Quillian, M. R. Retrieval time from semantic memory. 
Journal of Verbal Learning and Verbal Behavior, 1969, 8, 240—247. Fig. 1, p. 241.) 


The concept animal is broken down into birds and fish, which, in turn, are 
broken down further. Characteristics of a given word are stored with that word. 
But notice also that, if a given word at a given level has a certain characteristic, all 
other words below that word in the hierarchy also automatically possess that 
same characteristic. Thus, if we enter the network at fish and note that fish can 
swim, we know that all word entries below fish (on the same branch of the 
network) also can swim (that is, sharks and salmon can swim). 

Notice how efficient this storage system is. If we enter at the bottom of the 
hierarchy (for example, ostrich ), we immediately know quite a bit about 
ostriches. We know not only what is stored at that point (long legs, tall, cannot 
fly) but many other things as well. By moving up in the hierarchy we find that 
ostriches have wings and feathers. Still further up we find that ostriches eat, 
breathe, have skin, and can move. This network is efficient because not every 
attribute of a given word need be stored with that word. For example, the 
information has skin does not have to be repeatedly stored with each of the 
animals in the network. It is stored only once but is accessible from all points in 
the network. 

The next question is whether this conception of lexical memory predicts 
observed events. Although far from being completely validated, the hierarchy 
does seem to be consistent with certain data. For example, it predicts that certain 
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kinds of question should take longer to answer than others. Consider the follow¬ 
ing questions: 

1. Do canaries sing? 

2. Do canaries have wings? 

3. Do canaries have skin? 

Which of these questions do you think would be most quickly answered? 
Collins and Quillian predicted that the first question would be most quickly 
answered, because sing is stored directly with canary. Next, they predicted that 
question 2 would be answered more slowly than the first question but not so 
slowly as the third question. These predictions were confirmed experimentally. 
This is because wings is stored one level higher in the hierarchy than canary, 
whereas skin is stored two levels above canary in the hierarchy. According to the 
model, it takes time for activation to move from one level to another. The further 
apart two items are in the hierarchy, the longer it takes for a decision to be made 
about the relationship between them. Thus, there is at least some evidence for the 
usefulness of a hierarchical conception of memory. 

As a further example of the hierarchical position we will discuss an approach 
taken by Lindsay and Norman (1972). You should be aware that the available 
models differ considerably and that this is but one example chosen from many. 
Let us begin with a simple concept, such as tavern. We all have this concept stored 
within us, and it is not stored in isolation. It is related in many ways to many 
other bits of stored information. How might we characterize the relations be¬ 
tween this bit of information ( tavern ) and the rest of the information that is 
stored within us? In attempting to characterize these relationships, or to define the 
position held by tavern amid an enormous amount of stored information, Lind¬ 
say and Norman distinguish among “classes,” “examples,” and “properties.” 
Thus, tavern belongs to a certain class , which we can designate as establishments. 
Tavern is one example of an establishment. Tavern also has certain properties. 
For example, it holds beer and wine. Beer and wine are properties that we have 
come to associate with tavern. Finally, there are certain examples of the concept 
tavern (for example, Luigi’s). A picture, or diagram, of the semantic network that 
might be associated with the concept tavern is given in Figure 9.8. This network 
defines the relationships among tavern and its various classes, properties, and 
examples. Hilgard and Bower (1975) discuss the network as follows: 

A realistic memory, of course, contains thousands of such concepts, each with 
very many connections, so that the actual topographical representation would 
look like a huge “wiring diagram.” But a fantastic amount of information 
is inherently encoded in such graph structures. To see just a hint of this, 
consider the fragment of a semantic network surrounding the concept 
of a tavern [as shown in Figure 9.8]. This graph implicitly encodes the 
information that a tavern is a kind of business establishment (as is a 
drugstore), which has beer and wine, and Luigi’s is an example of a tavern. It 
also gives some properties of beer, wine, and Luigi’s. This is only a fragment, 
of course, and much more information could and would be in a realistic 
memory. But notice how very many questions one is enabled to answer with 
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Figure 9.8 Fragment of a semantic network surrounding the concept tavern. (Adapted 
from Lindsay, P. H., and Norman, D. A. Human information processing: An 
introduction to psychology. New York: Academic Press, 1972. Fig. 10.5, p. 389.) 

just this fragment. For example, it can answer questions that require chains 
of subset relations, such as that “Luigi’s is an establishment” or that “A 
drugstore is a place.” It can also read out the properties or classes that any 
two concepts have in common. Thus, if we ask the system to compare the 
similarities and differences of beer and wine, it would quickly find that the 
similarities are that they are both beverages sold at taverns, but one is made 
from fermented grain w'hile the other is made from fermented fruit. The 
number of factual relationships derivable and possible questions that can be 
answered increases exponentially as the number of encoded predicates or 
“bits of knowledge” increases, (pp. 594—595) 

It should be obvious at this point that any attempt to characterize the existing 
structure, or organization, of memory is difficult. The preceding example seems 
somewhat complicated. But when we realize that it represents nothing more than 
a very rough description of a very tiny segment of the total amount of information 
stored within each of us, then we can begin to grasp the magnitude of the task 


CHAPTER 9/MEMORY MODELS II: SEPARATE STORES, LEVELS, AND SEMANTIC NETWORKS 263 








































































before us. Modeling of this sort is in its infancy. If they are to be complete, such 
models must, of necessity, become more and more elaborate. Can you imagine 
attempting to draw up blueprints, or networks, that capture all of the overlapping 
and interrelated information stored within but a single individual? Putting a man 
on the moon was probably simple compared with this task. More than likely, 
models of this sort will never be complete. Rather, they will probably serve a 
heuristic purpose, stimulating thinking and suggesting questions. 

Matrices ' 

Although many investigators think of lexical memory in terms of hierarchies 
containing categories and instances of those categories, others think of the lexicon 
in terms of matrices. Figure 9.9 contains a sample matrix (and, for comparison, a 
sample hierarchy). 

Broadbent, Cooper, and Broadbent (1978) compared the hierarchical scheme 
and the matrix scheme. Their basic technique was to present the same list of 
words in either a hierarchy, a matrix, or a random order and then test for recall. 
They reasoned, that, if we naturally use a hierarchical scheme when we store 
words, then hierarchical presentation of the 16 words should facilitate recall 
relative to matrix or random presentation. What they found was that both the 
hierarchy and the matrix facilitated recall relative to the random control but that 
they did not differ from each other. Words presented in the matrix form were 
remembered about as well as words presented in the hierarchy form. 

Organization helped, but it made little difference which form of organization 
was used. Broadbent, Cooper, and Broadbent (1978) speculate that the four 
groups of items were what were important. They suggest that one item in a group 
stimulating one other item in the group (for example, dog and cat) may have been 
more important than the generating structure, or the cue words (printed in capital 
letters in Figure 9.9). 

The question of whether information is stored in hierarchies or matrices 
remains unanswered. The answer may be somewhat more complicated than a 
simple choice of one over the other. Some information may best be stored in 
matrices, whereas hierarchical storage may be more suitable for other kinds of 
information. Huttenlocher and Lui (1979), for example, suggest that nouns may 
be stored in hierarchies, whereas verbs tend to be stored in matrices. 

Feature Models 

The third approach to the understanding of lexical memory is often termed the 
feature (or sometimes the attribute or property) approach (see Smith, Shoben, & 
Rips, 1974). In general, this approach is concerned with how we categorize nouns 
when they are presented to us. For example, if canary is presented to us, what 
determines how long it will take us to decide whether canary belongs to the 
category bird ? In the hierarchy and matrix approaches just described, the quick¬ 
ness of this decision is determined by such things as how close the target and 
prime are to each other in some arrangement of words. In the feature approach 
the emphasis is not on any notion of spatial arrangement. Rather, the emphasis is 
on the number of features, or attributes, that canary and bird have in common. 

Hampton (1979) distinguishes between defining features and characteristic 
features. Defining features, on the one hand, provide necessary and sufficient 
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Figure 9.9 Two ways of thinking about lexical memory: (a) hierarchy and (b) matrix. 
(Adapted from Broadbent, D. E., Cooper, P. J., and Broadbent, M. H. P. A comparison of 
hierarchical and matrix retrieval schemes in recall. Journal of Experimental Psychology: 
Human Learning and Memory , 1978, 4, 486-497. Fig. 1, p. 487. Copyright 1978 by the 
American Psychological Association. Reprinted by permission.) 

criteria for making a decision about category membership. Characteristic features, 
on the other hand, are typical attributes of items belonging to the category, but 
they are not sufficient in and of themselves for determining category membership. 
For example, it is characteristic of birds to fly. But animals other than birds can 
fly, and some birds do not fly. Thus, flying is characteristic of birds, but not 
defining. On the other hand, birds are animals with feathers. These are defining 
characteristics, for if an object is an animal with feathers, it must be a bird. 

Hampton says that we reach a decision about whether a noun belongs to a 
category in two steps. First, we determine how many attributes an item has in 
common with the category. If there is a very large number of common attributes 
or a very small number of common attributes, then our decision is made swiftly 
and can be made on the basis of characteristic attributes. But if the overlap 
between category and item appears to be intermediate, then we become more 
careful, and we compare only defining attributes of the category with the attri¬ 
butes of the instance. This takes a little longer, or produces a slower decision 
process (see also McClosky &C Glucksberg, 1979). 

3 
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To this point we have discussed three ways of thinking about word storage 
(hierarchies, matrices, feature analyses). Although these approaches do differ, 
they also have one thing in common: They all assume that organization is crucial 
in memory. They represent the kinds of efforts that are being made in the 
structural, or organizational, approach to the study of memory (see also Rayner 
& Posnansky, 1978). 


Spreading-Activation Models 

There is a fourth kind of semantic-network model, called spreading-activation 
theory , which is extremely popular today. As we shall see, spreading-activation 
models have a good deal in common with other network models, but they 
introduce some new ideas as well. For example, these models are more general in 
that they are not limited to hierarchical relationships. They acknowledge that 
there are other kinds of semantic relationship, too. 

The original spreading-activation model, presented by Collins and Loftus 
(1975), is characterized in Figure 9.10. Notice first that the words stored in this 
network are interrelated in many more ways than a simple hierarchical configura¬ 
tion. Semantic storage is seen as being quite tangled and intertwined. Next, notice 
that the lines, or links between words, vary in length. For example, the link 
between red and orange is shorter than the line between red and roses. The 
shorter the length of the line between two words, the stronger the semantic 
relationships between those two words. 

The central assumption of the model is that, when a word is processed (such as 
when a subject sees the word red), activation spreads out along the pathways of 
the network. This spreading activation, going out in all directions, is assumed to 
weaken as it travels outward. Although it is not defined precisely, activation refers 
to the idea that an activated unit can be more easily processed (recalled, retrieved, 
recognized, judged, evaluated) than an unactivated unit. 

The model makes two further assumptions. First, activation decreases over 
time. For example, if you present the word red , surrounding elements will be 
activated for a little while but not forever. Second, intervening activities will 
decrease activation. Thus, if you present street after presenting red, the activation 
emanating from red will decrease. 

The spreading-activation model has had its greatest success in accounting for 
the priming effect in lexical decision experiments. First, the subjects are shown a 
prime, such as the word bird. Then the subjects are shown, for a very brief period, 
a target item, which may be a related word, such as canary, or any other string of 
letters (for example, house, preef). The subjects’ task is to make, as quickly as 
possible, some decision about the target item, such as deciding if it is a word, 
telling what the word is, or deciding if it belongs to a certain category. 

In general, it has been found that the speed of these decisions is increased if the 
prime is related to the target word. The closer the relationship, the faster is the 
decision (Macht & Spear, 1977; Massaro, Jones, Lipscomb, & Scholz, 1978; 
Neely, 1977; Sperber, McCauley, Ragain, & Weil, 1979). In other words, the time 
required for processing of the target item is reduced if the subjects are given 
information about the target item ahead of time in the form of a related prime. 
The usual interpretation is that the prime activates related material in the seman- 
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Figure 9.10 Example of a spreading-activation model in which the length of each line 
(link) represents the degree of association between two concepts. (Adapted from Collins, 
A. M., and Loftus, E. F. A spreading activation theory of semantic processing. 
Psychological Review, 1975, 82, 407-428. Copyright 1975 by the American Psycho¬ 
logical Association. Reprinted by permission.) 

tic network and makes it more readily processed. The longer you present the 
prime, the greater the spreading activation (Warren, 1977). 

Priming experiments are extremely popular these days (den Heyer, Briand, & 
Dannenberg, 1983; Henik, Friedrich, &C Kellogg, 1983; Huttenlocher & Kubieck, 
1983; Kiger & Glass, 1983; Reder &c Ross, 1983; Roediger, Neely, & Blaxton, 
1983; Stanovitch & West, 1983). Most of these studies support and elaborate on 
the spreading-activation position. Some studies, however, seem to conflict with 
the model (see Neely, Schmidt, & Roediger, 1983). In addition, the model has 
been criticized for its lack of precision, for the great number of assumptions it 
makes, and for its inability to make crisp, clean predictions (McCloskey & 
Glucksberg, 1979). 

Essentially, we are going to have to wait for the dust to settle before a 
reasonable assessment of the spreading-activation position can be made. At 
present, it is just too controversial to evaluate with any confidence. Some feel that 
this approach represents the one true path of the future (see Claxton, 1978), 
whereas others believe that it is a remarkably weak endeavor (see Wexler, 1978). 

Nevertheless, research and theorizing go on. In particular, John Anderson 
(1983) has presented a spreading-activation theory of memory called ACT, which 
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is an elaboration of an earlier version of the same theory (Anderson, 1976). ACT 
makes many of the same assumptions as Collins and Loftus (1975). But it is a 
much more general, mathematical, wide-ranging, and comprehensive model. 
Because ACT considers the storage of sentences, connected discourse, and knowl¬ 
edge in general, as well as lexical memory, a consideration of it will be delayed 
until Chapter 13. For now, it is enough to know that Anderson and others like 
him are hard at work tinkering with their models in an effort to extend our 
understanding of memory. 


SUMMARY 

1. Before the 1960s the two-factor theory of interference dominated the field of 
forgetting. 

2. The last 25 years have witnessed the rapid development of the information¬ 
processing approach to the understanding of memory. 

3. In contrast to earlier years, we now have a large number of available memory 
models. 

4. Encoding refers to the processes by which a representation of physical 
objects and events is developed by the nervous system. 

5. Storage refers to the persistence of information over time. 

6. Retrieval refers to the search for, and utilization of, stored information. 

7. Separate-store models propose two or more separate and distinct memory 
stores. 

8. According to the Waugh—Norman model, perceived items enter a limited- 
capacity primary memory (PM). Rehearsal will maintain items in PM and 
may contribute to their transfer to a more permanent secondary memory 
(SM). 

9. The introduction of new items into PM will bump out old ones. 

10. An item in PM has never left consciousness. Items recalled from SM have 
been absent from consciousness. 

11. The distinction between PM (or STS) and SM (or LTS) is characteristic of 
many current models. This dichotomy is in contrast to the unitary conception 
of memory suggested by interference theory. 

12. The Atkinson—Shiffrin model has three components: sensory register, short¬ 
term store (STS), and long-term store (LTS). 

13. Information in sensory memory decays within a fraction of a second. 

14. Information is transferred from sensory memory to STS through a process of 
attention. 

15. Visual sensory memory, or iconic memory, has been demonstrated by the 
work of Sperling. 

16. Auditory sensory memory, or echoic memory, is also thought to exist. The 
stimulus-suffix effect in serial learning is taken as evidence for this form of 
memory. Echoic memory is often called precategorical acoustic storage 
(PAS). 

17. Short-term store- has a limited capacity. 

18. Unless rehearsed, information may be lost from STS in a matter of seconds. 
Information may also be lost from STS through a displacement process. 
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19. Information is transferred from STS to LTS through rehearsal. 

20. Storage in LTS is relatively permanent. 

21. Information in LTS is subject to slow decay and interference. 

22. Control processes refer to the fact that the individual can go about using the 
fixed structural properties of the memory system in many different ways. 

23. As an example of control processes, subjects choose between alternative 
methods of rehearsing. Certain types of rehearsal maximize storage in STS, 
whereas other rehearsal strategies maximize storage in LTS. 

24. Structural properties may be universal, whereas control processes may vary 
across cultures. 

25. Directed forgetting represents an example of the utilization of control pro¬ 
cesses. 

26. The levels-of-processing approach argues against the separate-store ap¬ 
proach. It maintains that durability of memory trace increases as depth of 
processing increases. 

27. An incidental-learning paradigm is used to study levels. Various forms of 
orientation toward stimulus materials determine depth of processing. 

28. Retrieval of deeply processed information may require more of our process¬ 
ing capacity than retrieval of shallowly processed material. 

29. The levels approach has been criticized because shallowly processed informa¬ 
tion can be well retained and because we have no independent measure of 
depth. 

30. The emphasis in the levels approach has shifted to a study of the elaboration 
hypothesis and the distinctiveness hypothesis. These two factors have been 
assumed by some to account for depth-of-processing effects. 

31. Encoding specificity refers to the fact that recall will be best when cues 
present at the time of recall and at the time of learning are most alike. 

32. Semantic-network models assume that the storage of meaningful material 
is best thought of in terms of multiple, interconnected associations or rela¬ 
tionships. 

33. Lexical memory refers to that part of semantic memory involving words. 

34. Theorizing about lexical memory centers around hierarchical models and 
models based on matrices, features , and spreading activation. 

35. In hierarchical models words are assumed to be interconnected in hier¬ 
archies. Most hierarchies are efficient storage models, because not all infor¬ 
mation about each word has to be stored with that word. 

36. Presenting information in a matrix facilitates recall, but no better than does 
hierarchical presentation. 

37. Feature models emphasize the number of attributes that are common to a 
category and an instance whose membership in the category is to be judged. 

38. Defining features provide necessary and sufficient criteria for making a 
decision about category membership. Characteristic features are typical attri¬ 
butes but are not sufficient for defining category membership. 

39. When the number of common features is high or low, a decision is made 
quickly, and it can be made on the basis of characteristic features. But when 
the number of common attributes is intermediate, defining features are 
referred to, and the decision-making process is slowed. 
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40. Spreading-activation models assume complex relationships among stored 
units of information. When a unit is processed, activation spreads out along 
the pathways of the network. 

41. Activation decreases with time and is reduced by intervening activity. 

42. Spreading activation accounts for the priming effect in lexical decision 
experiments. 

43. In lexical decision experiments a prime is first presented, and then a target 
item is presented. The subjects must make some decision about the target 
item, such as “Is it a word?” or “What is this word?” 

44. It has been found that the decision process is accelerated if the prime is 
related to the target and if the prime is presented for longer periods. 

45. The conclusion is that the prime activates related material in our lexical 
structure. 

46. ACT, Anderson’s spreading-activation model, is a comprehensive theory of 
the storage and retrieval of knowledge. 
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INTRODUCTION 

Many divisions of memory have been proposed. Distinctions among processes 
and subprocesses can, and do, lead to controversy. The purpose of this chapter is 
to outline some of these important points of conflict in the study of memory. 

1. We shall consider the distinction between short-term and long-term store. 

Are the principles that govern memory over short intervals truly different 
from those that govern memory over longer intervals? 

2. We shall question the nature of the difference between recognition memory 
and recall memory. 

3. We shall discuss the popular distinction between memory for temporal and 
spatial events (episodic) and memory for knowledge and facts (semantic). 

4. We shall consider the parallels between animal and human memory. 

5. We shall examine the distinction that is often made between automatic and 
controlled processes in memory. 

6. We shall explore the effect of context, or state, on memory. 

7. We shall review split-brain research, which suggests that the two hemi¬ 
spheres of the brain tend to process different types of information. 

SEPARATE STORES? 

Interference Effects in STS 

We begin by addressing the distinction between long-term store (LTS) and short¬ 
term store (STS) that is so characteristic of information-processing models of 
memory. The postulation of two (or more) memory stores obviously requires that 
the stores differ in important ways. If we can identify one or more ways in which 
the principles governing the retention of information over short intervals differ 
from the principles governing the retention of information over longer periods, 
we may be able to justify the distinction. If we cannot, then we might as well give 
up the distinction as an intriguing but irrelevant idea. 

Let us begin our story by going back to 1959. At that time Peterson and 
Peterson developed what has come to be known as the distractor technique for 
the study of short-term memory. Subjects are first shown a stimulus (for example, 
three consonants) for a brief moment. They are then required to perform some 
distractor task, such as counting backward by 3s for a set number of seconds (for 
example, 3 or 18 seconds). The subjects are then required to recall the original 
three-letter stimulus. The distractor task (counting backward by 3s) is designed to 
keep the subjects from rehearsing the stimulus during the retention interval. If the 
subjects were not distracted in this manner, they would merely repeat the stimulus 
item over and over, thereby ensuring perfect recall. 

The overall Peterson and Peterson task sounds very simple. All we have to do is 
remember a group of three letters for a few seconds. If we are presented with 
XKZ, it seems highly unlikely that we would ever forget such a simple item within 
18 seconds. And yet the results of these experiments are quite remarkable. 
Retention of a simple three-unit stimulus drops to as little as 10% after 18 
seconds. That means we are not able to recall more than 1 out of 10 trigrams that 
we try to remember for only 18 seconds. A finding like that represents an 
enormous amount of forgetting over a very brief period of time. 
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We can see how these results tie in with the notion of STS, or primary memory. 
It seems as though the information is lost or decays within a few seconds because 
it is not rehearsed enough to either maintain it within STS or transfer it to LTS. 
(The distractor task prevents the requisite rehearsal.) 

RI AND PI. The question arises as to what accounts for this dramatic and 
unexpected loss over so short an interval. At the time the effect was first observed, 
interference theory, with its emphasis on proactive inhibition (PI) and retroactive 
inhibition (RI), was in full swing. It had been clearly demonstrated that long-term 
retention losses were heavily determined by PI and RI. But this new short-term 
memory task represented something unique and unknown. Did PI and RI operate 
here just as they did in long-term memory? Could interference theory account for 
some or all of the enormous forgetting observed in this new procedure? At first, it 
seemed to some as though interference mechanisms could not account for the 
losses over such short intervals. It did not seem as though the loss could be an RI 
effect, because it did not appear as if the subjects were learning anything between 
the presentation of the syllable and its recall test that could cause unlearning of, 
or compete with, the syllable. In addition, data were presented suggesting that 
recall did not decrease as the number of prior syllables that the subjects learned 
and recalled increased (Peterson, 1963; Peterson &C Peterson, 1959). That is, a 
significant PI effect was not observed. Psychologists began to suspect that interfer¬ 
ence mechanisms operated in LTS but not in STS. Short-term memory was 
perhaps free from interference effects, whereas long-term memory was heavily 
influenced by interference mechanisms. (The most common suggestions concern¬ 
ing the determining mechanisms in STS were decay and displacement.) 

But the proponents of interference theory were not long in responding to this 
potential schism. In an excellent study Keppel and Underwood (1962) outlined 
the problem and pursued a PI effect in the Peterson and Peterson (1959) situation. 
Using carefully controlled procedures they attempted to avoid methodological 
problems they identified in the original study. In contrast to the Petersons, they 
found that subjects’ ability to recall successive stimuli decreased as the number of 
previously learned and tested items increased (see Figure 10.1). They demon¬ 
strated that PI does, after all, operate in short-term memory. It was a dramatic 
success for the interference position, for it argues that all retention, whether long 
or short, is subject to PI. It argued against the necessity of a distinction between 
long-term and short-term memory. 

Since the original Keppel and Underwood (1962) study PI has been found in 
short-term memory many times (Loess, 1964; Mendell, 1977; Nowaczyk, 
Shaughnessy, & Zimmerman, 1974; Wickens, 1970). In addition, subsequent 
research has indicated that RI, as well as PI, operates in short-term memory 
situations. For reports of RI in short-term memory the interested reader is 
referred to Posner and Rossman (1965) and to Posner and Konick (1966). 

RELEASE FROM PI. There is one more area of research indicating that interference 
operates in a similar manner in both STS and LTS. We begin by noting that 
interference in LTS is heavily affected by similarity of the materials. We have 
already seen in previous chapters that similar materials often interfere with one 
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l if'nrc 10.1 Proactive inhibition in the Pcterson-Peterson situation. Proportion of correct 
responses as a function of the number of previous items and length of retention interval. 
(Adapted from Keppel, G., and Underwood, B. J. Proactive inhibition in short-term 
retention of single hems. Journal of Verbal Learning and Verbal Behavior, 1962, I, 
153-161. Fig. 4, p. 158.) 

another much more than do strongly dissimilar materials in both transfer and LTS 
situations. Now the question becomes this: Does similarity affect interference in 
STS in the same way? 

The release-front-PI effect indicates that the answer to this question is yes. 
Release from PI was originally demonstrated by Wickens, Born, and Allen (1963), 
but it has been demonstrated many times since then (see Bird, 1977; Russ-Eft, 
1979). Subjects are first given five trials of the Peterson and Peterson type, with 
the materials all being drawn from the same form class (for example, all words). 
As these trials progress, PI builds up. That is, performance on the succeeding trials 
decreases. Then, on the sixth trial, the Peterson and Peterson procedure is main¬ 
tained, but the type of material presented is changed. The subjects might be 
presented a nonsense syllable instead of a word. This shift in the form class of the 
material produces a “release from PI”; performance on the sixth trial jumps 
dramatically (see Figure 10.2). The notion here is that interference in STS is 
affected hy similarity in much the same manner that similarity affects interference 
in LTS; high similarity increases interference. 

In conclusion, it can no longer be said that a distinction between short-term 
memory and long-term memory can be made on the basis of the notion that 
short-term memory is free from interference effects. Short-term memory is subjict 
to PI and Rl. If we are to find a valid basis for the distinction between short-term 
and long-term memory, we must look elsewhere. 

Acoustic Versus Semantic Coding 

Is there anything else that might distinguish short-term memory from long-term 
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Figure 10.2 Release from PI (hypothetical data). 

memory? Early research seemed to indicate that short-term memory might in¬ 
volve acoustic coding processes, whereas long-term memory might involve 
semantic coding processes (Conrad, 1964; Wickelgren, 1965). In a very general 
sense what this means is that we can store information in short-term memory on 
the basis of the sounds of the items, whereas we can store information in 
long-term memory on the basis of the meaning of the material. If we wish to store 
information for a short period, we may focus on the sounds of the to-be- 
remembered material. For example, many of us have had the experience of having 
been given a phone number (824-2105, say) in a crowded, noisy room. While 
frantically searching for pencil and paper, we repeat the number over and over, 
maintaining the sounds of the digits. We in no way attempt to deal with the 
material on any more complicated level. We do not try to group the digits or to 
relate them to our past life. We merely maintain the information through sound. 
But, if we wish to remember the same number for a long period, we may encode it 
in a more complicated fashion. For example, we might relate the digits to events 
in our life (8 divided by 2 is 4; 21 plus 5 is my age, and so on). In other words, 
we may involve meaning in our coding process. 

For a time the distinction between long-term memory and short-term memory 
based on semantic and acoustic coding processes seemed to be gaining wide 
acceptance (Adams, 1967; Baddeley, 1966a, 1966b; Baddeley & Patterson, 1971; 
Kintsch & Buschke, 1969). But the semantic—acoustic distinction has fallen on 
hard times. Evidence began to accumulate that semantic coding might, after all, 
be involved in short-term memory (Shulman, 1971, 1972). Similarly, evidence for 
acoustic effects in long-term memory began to appear (Bruce &£ Crowley, 1970; 
Dale & McGlaughlin, 1971; McGlaughhn & Dale, 1971; Nelson & Rothbart, 
1972). It may well be that short-term memory often involves acoustic coding and 
that long-term memory characteristically involves semantic coding, but neither 
system is as restricted as Baddeley and others originally suggested. Depending on 
the demands of the task, we are capable of coding in many different ways. (For 
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further discussion see Forrester, 1972; Glassman, 1972; Jacoby & Goolkasian, 
1973; Raser, 1969; Tell, 1972; Wickelgren, 1973.) Once again we are forced to 
look elsewhere for support for the distinction between short-term and long-term 
memory. 


Serial-Position Effects 

When we are presented with a list of items (a string of words, for example) and 
asked to recall them immediately in any order we wish, a very interesting pattern 
of results is obtained. These results are summarized by the curve in Figure 10.3. 
Notice that items toward the end of the list are recalled better than any of the 
others. This fact is called the recency effect. Items toward the beginning of the list 
are recalled more often than items in the middle, but not as well as the end items. 
This fact is called the primacy effect. Items in the middle of the list are the most 
difficult to recall. 

THE PRIMACY EFFECT. Let us first dispense with the primacy effect, which is of 
minor interest to us in attempting to distinguish short-term from long-term 
memory. The general assumption concerning the primacy effect is that it reflects a 
greater amount of rehearsal given to early items in the list. Early items are recalled 
fairly well because they are rehearsed more than later items (see Rundus, 1971). 
Put yourself in the position of rehearsing a serial list. One item is exposed at a 
time. When the first item appears, you rehearse it over and over as many times as 
you can before the second item appears. When the second item does appear, you 
probably rehearse from the beginning of the list, repeating items 1 and 2 over and 
over again. When item 3 appears, you still attempt to rehearse from the beginning 
of the list, and so on. If this rehearsal strategy is followed, it is clear that early 
items will be rehearsed more than later items. Hence, they are learned and 
recalled better than later items (Reynolds & Houston, 1964). 

THE RECENCY EFFECT. Why are the last items recalled best of all? The rehearsal 
strategy just outlined cannot account for the recency effect. The most common 
interpretation of the recency effect (see Glanzer, 1972; Kintsch & Poison, 1979; 
Murdock, 1974) involves the distinction between short-term and long-term mem¬ 
ory. During presentation, all items enter short-term memory. As additional items 
are presented, some of the earlier items are lost. Some items are transferred to 
long-term memory. Immediately after all the items in the list have been presented, 
the items toward the end of the list are still in short-term memory. As a result 
these items have an extremely high probability of being recalled in an immediate 
retention test. Items further back in the list are no longer in short-term memory. 
Hence, relative to the end items, they have some lower probability of being 
recalled. The assumption is made that the recency effect is the result of the end 
items being stored in short-term memory at the time of recall. Lower recall of 
items further back in the list results from some of them having been lost complete¬ 
ly and some having been moved into long-term memory. 

This is an interesting explanation, but it is not without its critics (see Brodie & 
Murdock, 1977). As Wickelgren (1973) points out, there is no real need to 
postulate two memory systems to account for the recency effect. A single-memory 
model can account for it just as well as the two-memory model. Assume for the 
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Figure 10.3 Idealized serial-position curve for free recall. 

moment that there is only one memory store. Information enters this store and is 
then lost over time (owing to decay or interference). If we assume, as Wickelgren 
(1973) does, that this memory trace is lost rapidly at first and then more slowly, 
then the recency effect can be explained by a single memory model. The critical 
asssumption in this explanation is the notion that memory traces decay, or are 
lost, rapidly at first and then more slowly. It is the idea that forgetting will be most 
rapid immediately after learning and that the rate of forgetting will then be 
reduced as time goes on. Applied to the recency effect this position would argue 
that end items have not yet undergone this initial, substantial loss. Items further 
back in the list have already suffered the initial, substantial loss in strength. In 
summary, the recency effect can be understood in terms of a two-memory posi¬ 
tion or in terms of a single-memory position involving a slowing rate of loss. The 
recency effect does not prove that short-term memory is different from long-term 
memory. To the contrary, the distinction between short-term and long-term 
memory provides but one interpretation of the recency effect (see Watkins &C 
Peynircioglu, 1983). 

NEGATIVE RECENCY. Wickelgren’s interpretation of the recency effect is quite 
damaging to the notion that two memory stores are necessary to account for 
recency. Nevertheless, it is instructive to pursue the controversy a step further. 
The general strategy followed by those who believe in a two-memory interpreta¬ 
tion of the recency effect has been to try to find variables that affect only one 
portion of the serial position curve. It is assumed that, if a variable affects only the 
end of the curve, it is affecting short-term but not long-term memory. Similarly, if 
a variable affects only the beginning and intermediate positions or all positions 
equally, it is assumed that it is affecting long-term memory and not short-term 
memory (see Watkins, 1972; Whitten, 1978). 

For our purposes we shall deal with one particular variable, which has received 
considerable attention. Craik (1970) presented 10 successive lists to his subjects. 
They were tested for recall immediately after the presentation of each list. This 
immediate-recall test produced the expected kind of serial-position effect, with a 
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Figure 10.4 Serial position curves for immediate (o) and final (•) recall. (Adapted from 
Craik, F. I. M. The fate of primary memory items in free recall. Journal of Verbal 
Learning and Verbal Behavior , 1970, 9, 143-148. Fig. 2, p. 145.) 


pronounced recency effect (see upper curve in Figure 10.4). But then Craik did 
something a little different. After all 10 lists had been presented and tested, he 
asked the subjects to recall as many of the items as they could from the lists. The 
results of this final recall test were quite striking (see the lower curve in Figure 
10.4). The recency effect diappeared. In fact, recall of the final items in the list 
dropped below recall of items in earlier positions. This finding has been termed 
the negative recency effect. 

The negative recency effect has received a good deal of attention (Bartlett & 
Tulving, 1974; Cohen, 1970; Jacoby & Bartz, 1972). Many have looked on it as 
support for the distinction between long-term and short-term memory. The idea is 
that the end items can be recalled well in an immediate test because they are still 
in short-term memory. But in the final test the end items are not well recalled 
because they are no longer in short-term memory. They have either been lost or 
transferred to long-term memory with a relatively low probability of recovery. It 
is assumed that final recall of the end items drops below recall of earlier items 
because the end items have not been rehearsed as much as earlier items and are 
thus less likely to have been transferred to long-term memory. 

But Wickelgren (1973) attacks this interpretation, too. He argues that the 
negative recency effect can also be handled by a single-memory formulation. He 
argues that the recency effect disappears in the final recall test because the traces 
of the end items have undergone the kind of rapid loss, or initial decay, that we 
described above. His explanation of the fact that end-item recall actually drops 
below the recall of earlier items is identical to the reasoning proposed by the two- 
memory proponents. That is, earlier items are rehearsed more, learned better, and 
recalled more easily. According to Wickelgren, there is no need to postulate two 
memory systems. Thus, the evidence for two memory systems drawn from serial 
situations is, at best, equivocal. 
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Clinical Evidence 

Another line of potential support for the long-term—short-term distinction comes 
from studies of human patients who have suffered bilateral damage to, or surgical 
ablation of, the hippocampus. Many of the surgical treatments were undertaken 
in an effort to relieve severe epileptic conditions. If you are interested in the exact 
procedures and the nature of the damage to the brain, good accounts can be 
found in Baddeley and Warrington (1970), Drachman and Arbit (1966), Milner 
(1966), Talland (1965), and Warrington and Weiskrantz (1968,1970). 

What concerns us is the peculiar effect these events, either accidental or 
surgical, have on the patients’ memory: 

1. The ability to retain new information for very short intervals appears to be 
normal. 

2. Memory for events that occurred before the brain damage appears to be 
normal. 

3. The patients are unable to retain any new information for long periods. 

In other words, although old long-term memories and the entire short-term 
memory system appear to be intact, the subjects display an almost complete 
inability to form new long-term memory traces. It seems that the patients’ ability 
to transfer information from short-term memory to long-term memory has been 
destroyed. Information already in long-term memory appears to remain intact 
and can be retrieved, but no new information can be entered into long-term 
memory. Such patients can carry on conversations (through the use of short-term 
memory). But they cannot remember a person they met and talked with a few 
minutes before. (Information has been lost from short-term memory without 
having been transferred into long-term memory.) They know who they are and 
what their life was like before the damage (intact old long-term memory). But 
they have no lasting idea of what has happened to them or the world since their 
brain trauma (no new long-term memory). Imagine what this condition must be 
like. The patients have no idea of what has happened since the trauma. In fact, 
they are even unaware that any time has passed at all since the trauma. For them, 
regardless of how long they have been there, they just came to the hospital 
“yesterday” (Hilgard & Bower, 1975). Nothing has happened to them (that they 
can recall) since they came to the hospital. 

This disability, although unfortunate for the individuals involved, has been 
taken as evidence for the existence of two memory systems. Is it possible to think 
of these results in terms of a single memory store? Rather than arguing that the 
brain trauma somehow destroyed the subjects’ ability to transfer information 
from a distinct short-term memory to a distinct long-term memory, could w'e not 
argue that the trauma merely limits how well an item can be learned? Materials 
can be learned to the point at which they can be remembered for a short interval 
of time, but they cannot be learned to the point at which they will be remembered 
for very long. If we think of a single memory trace that can increase in strength, 
then these results can be thought of in terms of a limit having been put on the 
strength that this memory trace can attain. 

Another problem with trying to use these data as evidence for two memory 
stores is that sometimes the capacity to store new information in LTS is, although 
severely retarded, not entirely eliminated. For example, Lewis (1979)-reports the 
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case of H. M. In spite of losing almost all ability to store new information in LTS, 
H. M. did know that he had some form of memory impairment. He could retain 
some visual and tactile maze learning. He was able to learn a simple factory job. 
He retained, permanently, the information that President John F. Kennedy had 
been assassinated. In other words, the loss of the ability to move information into 
LTS is not, after all, as clear and distinct as the proponents of the two-memory 
position would need it to be to convince everyone of the validity of their position. 

Who knows enough about what ablation and damage do to the brain to 
conclude that one or another interpretation is correct? It would seem, whatever the 
final outcome concerning long-term versus short-term memory, that this type of 
evidence is difficult to accept as conclusive. There are just too many unknowns in 
the situation and too many alternative explanations to be certain that we have 
clearly distinguished between long-term memory and short-term memory. For 
further clinical evidence and argument you are referred to Warrington and 
Shallice (1969) and Shallice and Warrington (1970). 

The distinction between long-term memory and stort-term memory will be 
with us for a long time. It has now marbled much of our thinking about memory, 
but whether it is a valid distinction remains to be seen. We have examined four 
areas of study that have been proposed as providing evidence for the distinction. 
We have seen that none of them provides conclusive evidence for the distinction. 
We have not exhausted the relevant areas of research (Wickelgren, 1973, and 
Lewis, 1979, provide fuller reviews). But the sample we have considered suggests 
that, at the very least, the distinction between long-term and short-term memory 
should be approached with caution (see Bernbach, 1969, 1975; Craik 6 c Lock¬ 
hart, 1972; Melton, 1963; Murdock, 1974). 

RECALL VERSUS RECOGNITION 

We now move to the second major issue to be considered in this chapter. Is 
recognition memory fundamentally different from recall memory? When we are 
asked to recognize something, such as the correct answer to a question on a 
multiple-choice exam, are we doing something fundamentally different from 
what we do when we try to recall the answer to the same question? On an 
operational level, recall and recognition are clearly distinct. In a recall test we ask, 
“What is the item?” and in a recognition task we ask, “Is this the item?” 
(Underwood, 1972). We wish to find out whether this clear-cut operational 
distinction reflects any important underlying differences in the memory processes. 

Before we address that question, let us be sure we understand the difference 
between recall and recognition tasks. In a typical recall task, the subjects might be 
presented with a list of words and then asked to recall as many of them as they 
can. In a recognition task the subjects might be presented with the same list. But 
then during the test phase they would be presented with pairs of w r ords. One 
member of the pair would be an “old” item, or one that they had already seen. 
The second member of each pair would be a new item, or a lure. The subjects 
would be required to indicate which of the two items in each pair was the “old” 
item (Shepard, 1967). This is a typical recognition task, but there are many other 
varieties (see Hall, 1983; Kausler, 1974). The essence of all the recognition tasks is 
that the subjects must be able to recognize an item as having been observed before 
in a certain context (for example, in the list of presented items). 
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Recognition is generally superior to recall (but not always; see Watkins & 
Tulving, 1975). Very early in the study of memory it was discovered that we are 
able to recognize more than we can recall (Hollingworth, 1913; McDougall, 
1904; Myers, 1914). In a typical study the subjects are presented with a list of 
syllables, words, pictures, or colors. They are then tested for recall and recogni¬ 
tion. Subjects are typically able to recognize many more stimuli than they can 
recall. For example, sit down and try to list as many American Indian tribes or 
groups as you can. After a while it will seem as though you have exhausted your 
store of information. But you have not. You will be able to recognize many more, 
even though you could not recall them (for example, you may not have recalled, 
but may recognize, Choctaw, Cree, Pueblo, Mohawk, Blackfoot, Eskimo, Yurok, 
Seminole, Cheyenne, Ojibwa, Chippewa, Mohegan, Seneca, Ute, Zuni, and 
Arapaho). 

Sometimes the gap between recall and recognition is extremely wide. For 
example, Standing (1973) has shown that thousands of pictures shown to subjects 
can be recognized later with apparent ease. The title of his article is, in fact, 
“Learning 10,000 Pictures.” But when we turn to our ability to recall pictures, 
rather than merely recognize them, we find a different story entirely. Nickerson 
and Adams (1979) gave subjects empty circles and asked them to draw in U.S. 
pennies. As can be seen in Figure 10.5, the subjects were not very accurate at all. 
Something like 95% of them either omitted or incorrectly located the word 
liberty. Fifty percent of the subjects located Lincoln’s head incorrectly. In other 
words, although we do seem to be able to recognize thousands of pictures, our 
ability to recall even such a simple, common object as a penny is severely limited. 
A gap like this between recall and recognition certainly deserves our attention (see 
also Rubin & Kontis, 1983). 

Hypothesis 1: Unitary Strength 

Does this difference between recall and recognition reflect significant differences 
in memory processes? There are three major interpretations of the situation. The 
first, usually labeled the unitary-strength interpretation, maintains that the differ¬ 
ence is not particularly important. It has been assumed that the two tasks merely 
differ in terms of their sensitivity. Memorized items are assumed to differ in 
strength; the recognition test picks up, or reveals, weaker items than does the 
recall test. Some items seem to be strong enough to be recognized but not quite 
strong enough to be recalled. According to this traditional view, the difference 
between recognition and recall does not imply anything at all about underlying 
process differences. Memory is viewed as a unitary process in which items vary 
from weak to strong. The threshold for recognition is lower than it is for recall. 
Although this view is a traditional one and some psychologists feel it is outmoded, 
it does have its modern adherents (see Rabinowitz & Mandler, 1977; Tulving & 
Thomson, 1971). 


Hypothesis 2: Generation-Recognition 
Suppose that we present subjects with a list of 500 words. Half of the words are 
common (for example, child, office, supply). Half are uncommon (for example, 
ferule, julep, wattled). Then we test for recognition of these words. It has been 
found that uncommon words are more easily recognized than common ivords 
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Some examples of drawings 








| j Omission 


Incorrect location 


Incorrect location and wrong side 
Wrong side 


Figure 10.5 Errors produced during recall of a penny. (Adapted from Nickerson, R. S.„ 
and Adams, M. ). Long-term memory for a common object. Cognitive Psychology , 1979, 
//, 287-307. Figs. 1 and 2, p. 289.) 


282 PART FOUR/RKTKNTION 

































































































(McCormack & Swenson, 1972; Shepard, 1967). If recall and recognition involve 
similar processes, we would expect the same sort of effect to appear when we 
tested for recall of common and uncommon words. But just the opposite effect 
has appeared. Common words appear to be better recalled than uncommon 
words (Kinsbourne & George, 1974; Murdock, 1960; Sumby, 1963). The same 
variable (common versus uncommon) thus has opposite effects on recall and 
recognition. This disparity has led many investigators to suggest that recognition 
and recall may well involve different underlying processes (see Bower & Bostrom, 
1968; Kintsch, 1970). 

Deciding exactly what these differences are is another matter. One common 
interpretation has been that recall involves retrieval processes, whereas recogni¬ 
tion does not. In a sense recall is seen as involving all of the processes of recogni¬ 
tion, and then some. Recall and recognition are often viewed as being different 
not so much in the nature of the underlying processes but in terms of the number 
of processes involved. In order to recognize a bit of information, on the one hand, 
the subjects must have somehow encoded and stored it, but they need not search 
for it during the recognition test. To recall an item, on the other hand, the subjects 
must encode, store, and retrieve. 

Putting it another way, subjects asked to recall an item must first generate that 
item, or dredge it (and presumably many others) up out of storage, and then 
recognize it as the correct item. When subjects are asked to recognize an item, the 
generating is done by the experimenter, and the subjects need only recognize the 
provided item. 


Hypothesis 3: Control Processes 

Another way to think about the differences between recall and recognition is in 
terms of variations in control processes. If you will recall, Atkinson and Shiffrin 
(1971) refer to the fact that we use the fixed structural properties of the memory 
system in many different ways. Differences between recall and recognition might 
be traced to differences in control processes rather than to fundamentally differ¬ 
ent memory systems. In attempting to memorize materials for future recall, 
individuals may use the fixed structural properties of the memory system in one 
way. In attempting to memorize for future recognition, they may change their 
rehearsal strategy and utilize the fixed properties of the system in another way 
(see Griffith, 1975). 

Subjects do, in fact, code material differently, depending on whether they 
expect a recall test or a recognition test (Leonard & Whitten, 1983). For example, 
Carey and Lockhart (1973) had subjects learn a list with the expectation that they 
would be tested for recall. In a surprise move the subjects were actually tested for 
recognition. Their recognition performance was much poorer when they had been 
expecting a recall test than when they had been expecting a recognition test. Of 
course, the situation would become complicated if the individuals were unaware 
of the nature of the upcoming retention test (recall or recognition). In this case the 
subjects’ strategy would be to maximize their chances of success in the light of 
their best guess about the nature of the upcoming test. 

As a further example of the ways in which we might differ in our preparation 
for recall and recognition tests, consider what we might do with an item such as 
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XKLMJG. If we are to try to recall XKLMJG, we must be sure the entire item is 
encoded. But if we encode the item for recognition, we might attend to some 
portion of the stimulus rather than to the whole thing (for example, encoding the 
fact that an item with L and M in the middle is an “old M item). 

Pursuing this notion that recall and recognition may involve differences in 
control processes, we may ask what kinds of difference there are between the two. 
Schmidt (1983) has provided some information concerning memory for prose. He 
found that people expecting a multiple-choice test (recognition) remembered a 
great deal of detail. In contrast, people expecting an essay test (recall) remem¬ 
bered higher-order units, trends, and themes. In other words, our expectations 
about how we will have to use information can have quite complex effects on 
how we process that material. 

Recognition Failure 

Thus far we have been talking as though recognition were always better than 
recall. But such is not the case. The recognition-failure phenomenon has been 
demonstrated repeatedly (see Flexser & Tulving, 1978; Jones, 1978; Muter, 1978; 
Rabinowitz, Mandler, & Barsalou, 1977; Wallace, 1978). The basic experimental 
demonstration of recognition failure goes like this: The subjects study A-B pairs. 
They are first tested for recognition of the B elements alone. Then they are tested 
for recall of B, given A as a cue. In this situation it is often found that subjects can 
recall B items that they were unable to recognize. This is a very controversial but 
robust effect, and it adds to the complexity of the distinction between recall and 
recognition. Later in this chapter, in a section on the role of contextual cues in 
memory, we shall consider the current interpretations of the recognition-failure 
effect. But for now, what needs to be pointed out is that the recognition-failure 
effect is not predicted by two of our major interpretations of the recall-recogni¬ 
tion difference. The unitary-strength theory does not predict it, and the genera¬ 
tion-recognition position does not predict it, either. Both of these theories, or 
hypotheses, predict that recognition will be better than recall. Thus, we are left 
with two interpretations of the recall-recognition difference that conflict with the 
recognition-failure effect. Perhaps the third interpretation of the recall-recogni¬ 
tion difference, the one having to do with differences in control processes, will 
prove to be the one that can account for the recognition-failure effect. 

In summary, recall and recognition do differ on an operational level. Certain 
variables do affect the two differentially. And yet there is little agreement over 
what these differences mean. We have seen that the differences have been inter¬ 
preted in terms of a unitary-strength hypothesis, a generation-recognition 
hypothesis, and differences in control processes, including organizational differ¬ 
ences. If this all seems a bit confusing, you should understand that it is psychology 
and not you that is confused. There simply is no general agreement about the 
relationship between recall and recognition. 

EPISODIC VERSUS SEMANTIC MEMORY 
Thus far we have asked whether short-term memory is different from long-term 
memory and whether recognition memory is different from recall memory. These 
distinctions exemplify the tendency to split, or divide, memory. Another example 
of this trend is found in the suggestion that different types of information will 
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require different memory models. Thus, we might need one sort of model to 
account for our ability to remember words and another model to account for our 
ability to remember sentences or paragraphs. We might need one model to 
account for the encoding, storage, and retrieval of visual information, another to 
account for auditory information, and so on. 

Typical of this tendency to suggest that different principles will govern memory 
for different types of information is Tulving’s (1972) distinction between episodic 
and semantic memory. Episodic memory, on the one hand, is assumed to store 
information about temporally arranged or dated events and episodes. Personal 
experiences are stored in episodic memory. Semantic memory, on the other hand, 
refers to memory for the meaning of words, concepts, and facts. Information 
stored in semantic memory is not temporally ordered, or related to time. 
Examples of the kind of information stored in episodic memory would be: 

1. 1 remember spending the early years of my life in Utica. 

2. 1 must be at school tomorrow before 8:30 A.M. 

3. 1 met a very uninteresting person on the bus yesterday. 

Most laboratory experiments involve episodic memory. For example, remember¬ 
ing that LXG was paired with KPY in a particular laboratory experiment would 
be an episodic memory. 

Examples of the kind of nontemporal information stored in semantic memory 
would be: 

1. 2 + 2 = 4. 

2. Whales nurse their young. 

3. Almost everyone has a nose. 

These bits of information are not temporally ordered—they are not remembered 
in temporal relation to other experiences. 

In describing the distinction Tulving (1972) does point out that the two systems 
often interact with each other. But he feels that they can function independently 
and that they may well differ from each other in terms of encoding, storage, and 
retrieval processes. For example, Neely and Payne (1983) report some findings 
suggesting that recognition failure, described in the preceding section, may occur 
in episodic but not semantic memory. 

The Tip-of-thc-Tonguc Phenomenon 
We have all had the experience of feeling confident that we know a particular 
word but being unable to recall it. We know we know the word, but we can’t 
quite retrieve it. This experience has been called the tip-of-the-tongue (TOT) 
phenomenon by Brown and McNeill (1966). 

The material in Box 10.1 may help you experience TOT. Try to supply the 
correct word for each of these definitions. Sometimes you will know the word 
right away. Sometimes you will be sure you don’t know the word. But sometimes 
you will have the feeling that the word is “on the tip of your tongue.” (The correct 
words are contained in the box at the end of the chapter.) 
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Box 10.1 The Tip-of-the-Tongue Phenomenon 

Look at each of the definitions below. Supply the appropriate word for the 
definitions, if you know it. Indicate “Don’t know” for those that you are certain 
you don’t know. Mark TOT next to those for which you are reasonably certain 
you know the word, though you can’t recall it now. For these words, supply at 
least one word that sounds similar to the target word. The answers appear later 
in the text. N 

1. An absolute ruler, a tyrant. 

2. A stone having a cavity lined with crystals. 

3. A great circle of the earth passing through the geographic poles and any given 
point on the earth’s surface. 

4. Worthy of respect or reverence by reason of age and dignity. 

5. Shedding leaves each year, as opposed to evergreen. 

6. A person appointed to act as a substitute for another. 

7. Five offspring born at a single birth. 

8. A special quality of leadership that captures the popular imagination and 
inspires unswerving allegiance. 

9. The red coloring matter of the red blood corpuscles. 

10. Flying reptiles that were extinct at the end of the Mesozoic Era. 

11. A spring from which hot water, steam, or mud gushes out at intervals, found 
in Yellowstone National Park. 

12. The second stomach of a bird, which has thick, muscular walls. 

13. The green coloring matter found in plants. 

14. The long-haired wild ox of central Asia, often domesticated as a beast of 
burden. 

15. The art of speaking in such a way that the voice seems to come from another 
place. 

From Cognition by Margaret Matlin. Copyright © 1983 by CBS College Publishing. Reprinted by permission of Holt, 
Rinehart and Winston, CBS College Publishing. 


What does TOT have to do with the distinction between semantic and episodic 
memory? Generally speaking TOT has been thought of as involving semantic and 
not episodic memory; it involves retrieval of materials (usually words) that are 
not temporally ordered. But more recent research (Schachter, 1983) suggests that 
TOT can be demonstrated in episodic as well as in semantic memory. Hence, even 
though the distinction between episodic and semantic memory is logically clear, it 
is not so easy to demonstrate that difference experimentally. 

Flashbulb Memories 

The distinction between episodic and semantic memory is fairly new. Whether it 
will prove to be a lasting distinction remains to be seen (see McKoon & Ratcliff, 
1979). The two types of memory have not been compared experimentally very 
many times. One reason is that the vast majority of experimental studies involves 
information of the episodic variety. Most experiments put subjects through an 
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“episode,” which they have to remember. Few studies have been concerned with 
preexperimental semantic information existing in long-term store. This situation 
is changing, however. More and more efforts to study “natural” versus “labora¬ 
tory” memory are appearing (see Wilkinson & Koestler, 1983). One kind of 
natural episodic memory that has been studied is called flashbulb memory 
(Brown Sc Kulik, 1977; Winograd Sc Killinger, 1983). Flashbulb memories in¬ 
volve a situation in which we first learned of a very surprising, arousing, or 
emotional event. For example, think about when you first learned that President 
Ronald Reagan, or John Lennon, had been shot. You will probably be able to 
recall quite a bit about some or all of the following aspects of these situations: 

1. Where were you? 

2. What was going on when you heard the news? 

3. Who gave you the news? 

4. What were your immediate feelings? 

5. What did those around you feel? 

6. What happened immediately after you received the news? 

Your episodic memory for this event will be quite detailed when compared with 
your memory for less surprising, less emotional events. For example, what do you 
remember about the situation in which you first learned that Reagan was going to 
run for a second term as president? Probably nothing at all. Surprising, emotional 
events tend to be elaborated more heavily than ordinary events, and they tend to 
be tied to the temporal aspect of the situation. Flashbulb memories can best be 
described as extreme episodic memories. 

Mixed Memories 

One of the basic problems with the semantic—episodic distinction is that many 
memories may involve episodic, semantic, or both episodic and semantic aspects, 
and we are not very good at demonstrating which. For example, we know that the 
naming of a flashed word is facilitated by a single prior presentation of that word 
in a long list of words. But as Feustel, Shiffrin, and Salasoo (1983) point out, we 
don’t really know whether this facilitation involves semantic memory, episodic 
memory, or both. What was stored during that first exposure—memory for the 
event, memory of the word, or both? At this point we simply don’t know. 

Studying “natural” versus “laboratory” memory doesn’t solve all our prob¬ 
lems, either. Our memory for “real life” events is probably the result of some 
complex process involving repeated attempts to recall, rehearse, and elaborate the 
stored information (see Wilkinson Sc Koestler, 1983). For example, your memo¬ 
ries of your childhood are probably the result of many conversations (rehearsals) 
you have had with your parents and others through the years. Any given memory 
you have may be a complex mix of semantic and episodic elements. 

In any case, the distinction between episodic and semantic memory does 
provide us with an example of the tendency to propose different models for 
different types of information (see Dooling Sc Christiaansen, 1977; Light, Kimble, 
Sc Pellegrino, 1975; Underwood, Boruch, Sc Malmi, 1978; Watkins & Tulving, 
1975). 
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ANIMAL VERSUS HUMAN MEMORY 

Let us return to a familiar question: Do we need one theory or set of theories to 
account for human behavior and another to account for animal behavior? More 
specifically, do animals and humans forget in essentially the same manner, or are 
they such different creatures that we will be forced to develop distinct models for 
each? 

We have seen that a large number of species have been used in the study of 
learning and memory. Rats, cats, monkeys, guinea pigs, pigeons, Planaria, and 
even the octopus have all been used in the laboratory. We can only assume that 
the investigators using these animals wanted to discover laws and principles that 
would eventually help them understand human behavior. 

Until recently, memory per se did not receive a great deal of attention in the 
field of animal behavior. But the recent upsurge of interest in human memory has 
been paralleled by a similar swell of interest in animal memory (Honig & James, 
1971). Demonstrations of animal-memory phenomena have ranged all the way 
from bowed serial-position curves (Wagner & Pfautz, 1978) and memory for 
discrimination (Smith &C Spear, 1979) to demonstrations of PI and RI in classical 
conditioning situations (Wickens, Tuber, Nield, & Wickens, 1977) and observa¬ 
tional learning in rats (Denny, Bell, & Clos, 1983). 

The position taken in this text is that, in spite of an enormous gap between the 
animal and human literatures, there are important points of contact. These points 
of contact will be emphasized. You should be aware that this is a biased position. 
Many authors argue that theories of animal and human memory will have to be 
different (see Winograd, 1971) and that a single, unified theory of memory 
encompassing animal and human behavior is highly unlikely (Grant, 1964). 

If you think about it for a moment, you will realize that we began to discuss 
similarities between animal and human memory systems in Chapter 8, where 
consolidation theory was outlined. Consolidation theory, although based for the 
most part on animal experimentation, is a system designed to describe memory 
events not only for animal populations but also for humans. There are many 
other points of contact between animal and human memory. For example, in 
Chapter 12 we shall learn how animals solve problems and form concepts in ways 
that parallel the methods used by humans. And in Chapter 13 we shall discuss the 
fascinating controversy surrounding apes and language. 

For now, we want to consider four major areas of contact between animal and 
human memory—interference effects, short-term memory, spatial memory, and 
memory for serial patterns. 

Interference and Animal Memory 

If we want to argue that animal and human memory processes are similar, and we 
know (see Chapter 8) that interference operates in human memory, then interfer¬ 
ence should also operate in animal memory. Do animals suffer losses in retention 
that can be attributed to the effects of prior learning (PI) and subsequent learning 
(RI)? The answer appears to be yes. Poor retention of an acquired response 
system in animals does appear to be related to previous and subsequent learning. 
As Spear (1971) puts it, interference phenomena such as PI and RI are common 
types of retention lapse in animals, although they are often not recognized as 
such. A number of studies specifically designed to demonstrate PI and RI in 
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animals have been successful (Chizar & Spear, 1968; Crowder, 1967; Gleitman & 
Jung, 1963; Koppenaal & Jagoda, 1968; Maier, Allaway, 6 c Gleitman, 1967; 
Roberts & Grant, 1978; Spear, 1973; Zentall & Hogan, 1977). It is difficult to 
conclude with any certainty that the PI and RI effects in animal and human 
designs are strictly comparable (see Gleitman, 1971; Revusky, 1971). But the 
rough correspondence between animal and human experiments and results does 
suggest that we must take seriously the possibility of similar cross-species interfer¬ 
ence effects. 

When considering interference phenomena and the predominantly human ver¬ 
bal research associated with them we must keep in mind that many interfer¬ 
ence concepts are found in the early animal literature. We have already seen that 
the unlearning factor in the two-factor theory of interference has been viewed as 
analogous to the experimental extinction phenomenon. In addition, demonstra¬ 
tions of spontaneous recovery in human verbal behavior (for example, Ceraso & 
Henderson, 1965) provide another point of contact between human learning and 
animal conditioning. 

How one interprets these points of contact is another matter. Some authors (for 
example, Winograd, 1971) argue that, although some interference concepts have 
their roots in the animal laboratory, the meanings of the concepts change as the 
concepts are applied to human situations. The labels may be the same ( extinction , 
spontaneous recovery, and so on), but the processes to which they are applied 
differ in the animal and human verbal situations. Many other authors feel, 
however, that interference effects are pervasive in the animal kingdom. For them, 
the traditional emphasis on human research in the area of memory does not 
necessarily imply that human and animal memory systems are qualitatively 
different. 


DMTS and Animal STS 

Investigators in the field of animal memory have developed a technique to study 
STS in animals. As we know, the Peterson—Peterson distractor technique is one of 
the major paradigms in the study of human STS. So as we review this new animal 
paradigm, which is known as the delayed-matcbing-to-sample procedure, or 
DMTS, we should ask ourselves how comparable the two techniques are. Can we 
compare data obtained from animals in DMTS situations with human data 
gathered in distractor situations? The best answer is that we should be very 
cautious about making too much of the similarities between the animal and 
human procedures, but we can proceed hopefully. 

The basic DMTS procedure is this: An animal (for example, a pigeon) is shown 
a stimulus such as the one in the left side of Figure 10.6. Then a short delay, or 
retention interval, is introduced. Then the two stimuli on the right side of Figure 
10.6 are presented. If the animal responds to the “correct” stimulus, the one it had 
seen previously, it is rewarded. 

The last 10 years have witnessed an enormous growth in the use of this 
paradigm (see Grant, Brewster & Stierhoff, 1983; Parkinson & Medin, 1983; 
Roitblat & Scopatz, 1983; Spetch & Wilkie, 1983). Correct DMTS performance 
drops off very rapidly and steeply as the time between the two stimulus presenta¬ 
tions increases (Grant, 1976). If the animal in a DMTS situation is delayed for as 
little as a few seconds, correct matching deteriorates rapidly. As we learned in 


CHAPTER 10/ISSUES IN MEMORY 289 






First exposure 

Second exposure 
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Figure 10.6 The DMTS procedure. The animal is first exposed to a particular stimulus 
configuration (vertical lines). Then, after a delay the animal is exposed to the old 
stimulus paired with a new stimulus, and choice behavior is observed. 

Chapter 9, this is also what appears in studies of human short-term memory; 
performance drops off rapidly in a few seconds. Similarly, just as interference 
effects have been found in human STS, so, too, do they appear in animal DMTS 
experiments. Both RI (Roberts & Grant, 1978) and PI (Grant, 1975; Zentall & 
Hogan, 1977) have been demonstrated, suggesting that animal and human short¬ 
term memory processes are similar. 

There are some obvious differences, however, between DMTS and the Peterson 
and Peterson method. In the human situation, for instance, the retention interval 
is always filled with a distractor task, such as counting backward by 3s. In the 
DMTS situation the animal is often left alone. And the Peterson and Peterson 
experiment is a recall situation, whereas DMTS involves recognition. These 
differences need not lead us to conclude that the effects obtained in the two 
situations are totally incomparable. But they do suggest that we be cautious. 

Spatial Memory 

What else is there about animal performance that leads us to suspect that humans 
are not all that different from animals in their memory processes? One area of 
research that is often mentioned has to do with the rat’s behavior in a radial maze 
(see Figure 10.7). 

It is obvious that humans can remember and utilize information about the 
spatial configuration of their environment. For example, when you search for a 
clean pair of socks in the morning, you tend to look in every reasonable spot 
without repeating yourself. You don’t look in the same drawer twice (at least not 
until you have exhausted other possibilities), because you can remember that you 
just looked there. Our spatial memory is good but not perfect. For example, we 
tend to remember a given distance with many turns as being longer than the same 
distance with fewer turns (Moar & Bower, 1983). But can animals engage in 
similar behavior? It would be to their advantage in terms of such things as food 
gathering if they could. If a humming bird feeds on the nectar of a certain flower, 
it would be advantageous if the bird could remember not to visit that particular 
flower again for a while until the plant had had time to replenish the nectar 
supply. 
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Figure 10.7 A radial maze. (Adapted from Olton, D. S., and Samuelson, R.J. 
Remembrance of places passed: Spatial memory in rats, journal of Experimental 
Psychology, 1976, 2, 97—116. Copyright 1976 by the American Psychological 
Association. Reprinted by permission.) 

This kind of beneficial memory for location in space has been demonstrated 
using rats in a radial maze like that in Figure 10.7 (Olton, 1978; Olton Sc 
Samuelson, 1976). The maze had eight arms radiating out from a central area. A 
piece of food was put in the end of each arm. The rat was placed in the central 
area and allowed to choose and go into any arm. Once it ate the food in a 
particular arm, that arm remained without food, even though the rat could go 
back into that arm at any time. 

The question was what the hungry rat would do given the opportunity to freely 
roam this maze. Quite simply, the rat very quickly learned to enter only arms that 
it had not previously entered. If we define a correct choice as entering a previously 
unentered arm, we find that within a few trials the mean number of correct 
choices was above seven. The rat almost never entered a previously visited arm, 
thereby rendering its food gathering very efficient. 

This behavior seems to be consistent with the idea that the rat has a good 
memory for spatial locations. But there are some alternative interpretations that 
must be considered. For example, it is possible that the rats were “marking” the 
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arms they visited with, say, a drop of urine. In this way they could use odor as a 
cue not to revisit a particular arm. This possibility has been discounted by several 
kinds of experiment. Olton and Samuelson (1976) soaked the maze with after¬ 
shave lotion to hide any possible odor trails the rats might be leaving. And 
Zoladek and Roberts (1978) surgically disrupted the rats’ ability to smell. In both 
these studies the rats still performed well in the maze, suggesting that it is spatial 
memory and not odor that accounts for the effects. 

Another possibility is that the rats are merely learning a chain of responses, 
such as always going into the next arm to the right. This interpretation is denied 
by the fact that the rats do not always make their choices in some fixed sequence, 
such as moving counterclockwise around the maze. Although there is some 
regularity in their choice patterning, there is not enough of it to account for their 
performance. These and other studies (see Beatty & Shavalia, 1980) lead us to the 
conclusion that rats do possess spatial memory and that, in this sense, their 
memory apparatus is similar to that of the human. 

Serial-Pattern Learning 

It’s pretty obvious that many of the tasks we deal with in everyday life involve 
serial patterns. We often have to learn “what follows what.” For example, written 
and spoken language involve complex serial patterns. Similarly, music involves 
the organization of successive sounds. Humans are very good at analyzing, 
perceiving, and remembering serial patterns. For example, consider the sequence: 

1234234534564567_ 

What goes in the blanks? The answer (5678) is even more obvious w r hen we group 
the numbers as: 


1234 2345 3456 4567_ 

The question we want to address here is whether animals also respond to, and 
learn about, the serial patterns inherent in the stimuli they perceive. Hulse (1978; 
Hulse & Dorsky, 1979) put the question to experimental test. Two groups of 
hungry rats were allowed to run down a runway to a goal box, where they 
received different numbers of food pellets. One group of animals received 14 
pellets on the first run, 7 pellets on the second run, 3 on the third, then 1, and 
finally 0 on the fifth run. Thus, this group experienced a 14-7-3-1-0 sequence, 
with each run producing a smaller reward than the preceding run. This was called 
the monotonic group. The second, or nonmonotonic , group received rewards in a 
14-1-3-7-0 sequence. The assumption was that, for a rat, the 14-7-3-1-0 sequence 
would be much simpler than the 14-1-3-7-0 sequence. 

As both groups gained more experience through repeated runs through their 
assigned sequence, it was assumed that they would learn about the serial pattern¬ 
ing of the rewards. Specifically, it was predicted that both groups would learn to 
run fast on the first trial of each sequence, when both were rewarded with 14 
pellets. It was also assumed that they would both learn to slow down on the fifth 
run of each series, because no reward was given on this run. Figure 10.8 shows 
that this is what happened; both groups ran fast on the 14-pellet run and slow on 
the 0-pellet run. 
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Figure 10.8 Asymptotic running times on five successive runs reinforced by different 
numbers of food pellets. One group of rats received a monotonic, decreasing series of 
food quantities (14-7-3-1-0 pellets). The other group received a nonmonotonic series of 
food quantities (14-1-3-7-0 pellets). (From Hulse, S. H. Cognitive structure and serial 
pattern learning by animals. In Hulse, S. H., Fowler, H., and Honig, W. K.(Eds.). 
Cognitive processes in animal behavior. Hillsdale, NJ: Erlbaum, 1978.) 

But now comes the crucial prediction. If rats really do pay attention to serial 
patterns, and the 14-7-3-1-0 sequence is simpler than the 14-1-3-7-0 pattern, then 
the monotonic group should be more likely to learn that the fifth run produced no 
food and should thus run slower than the nonmonotonic group on the 0-pellet 
run. As you can see in Figure 10.8, that was the case. These kinds of experiment 
have been taken as evidence for the idea that rats do process and utilize serial 
stimulus patterns. They may not be as good at it as humans are, but they learn 
patterns (see also Self & Gaffan, 1983). 

In summary, theories about animal and human memory have much in com¬ 
mon. Many of the principles and mechanisms found in the study of human 
memory are also found in the animal literature. Whether the investigations of 
animal memory and human memory will eventually converge or diverge remains 
to be seen. At present, the best that can be said is that the two fields, although 
somewhat isolated, do seem to be feeding into and benefiting from each other. 

AUTOMATIC VERSUS CONTROLLED PROCESSING 
The distinction between automatic and controlled processing was introduced in 
Chapter 5, but it bears mentioning again because it is an important dichotomy in 
the current study of memory. Controlled processing of information is effortful, 
slow, serial, and limited in capacity. Automatic processing is faster, is parallel, 
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and includes unlimited capacity without attention on the part of the subject being 
required (see Fisk & Schneider, 1984; Reiner & Morrison, 1983; Rosinski, 
Golinkoff, & Kukish, 1975; Salmaso, Baroni, Job, & Peron, 1983; Schneider & 
Fisk, 1974; Schneider & Shiffrin, 1977). 

The ultimate usefulness of this distinction remains to be seen. Some feel it is 
important, but others doubt that the two systems can be clearly separated. 

CONTEXTUAL CUES AND STATE-DEPENDENT MEMORY 
The situation, or context, in which we learn something is very important when it 
comes to remembering what we have learned. If we learn a task in Situation A and 
are then tested for recall of that task, we will do better if we are tested in the 
original Situation A than we will if the situation is changed to B. 

There are a number of diverse experimental demonstrations of the truth of this 
generalization. For example, Smith (1979) had subjects learn some material in 
Room A and then tested for recall either in Room A or Room B. He found 
superior recall in Room A. In a second experiment he encouraged the subjects to 
try to remember and imagine Room A even though they were being tested in 
Room B. In this case he found less of a decrement in recall when subjects were 
tested in Room B. The term state-dependent memory is often used to describe 
these sorts of effect, because memory seems to be dependent on the state, or 
situation, that existed during original learning. 

State-dependent effects are not limited to rooms. Similar effects can be obtained 
with chemical states (Eich, Weingartner, Stillman, & Gillin, 1975). In these 
studies subjects learn a task either under the influence of a particular drug (for 
example, alcohol, caffeine) or drug free and are then tested either with or without 
the drug. If retention is better under unchanged conditions (that is, drug—drug or 
no drug—no drug) than under changed conditions, we say state-dependent mem¬ 
ory has been demonstrated. 

Context effects are not limited to rooms or drugs, either. The verbal context in 
which a word is learned can be important, too (Baker & Santa, 1977; Smith, 
Glenberg, & Bjork, 1978; Tweedy, Lapinski, & Schvaneveldt, 1977). If a word is 
learned while embedded in a particular semantic context (for example, between 
two other words), free recall of that word will be better if the surrounding words 
are presented during recall efforts. 

Stanovich and West (1983a; 1983b) have extended the analysis of the impor¬ 
tance of the verbal context by showing that the processing of the last word in a 
sentence is better if the preceding sentence is congruous. Visual contexts can be 
important, too. For example, Biederman, Glass, and Stacy (1973) had subjects 
search for particular objects either in a whole, normal picture or in that same 
picture when it had been cut into six pieces and rearranged. Search was much 
more effective in the whole condition, presumably because the context of objects 
in their proper relationships served as effective cues in the search process (see also 
Winograd & Lynn, 1979). 

State-dependent-memory effects, and the general importance of context in 
memory, have become more and more important as the study of memory grows. 
Interested readers are referred to Bartling and Thompson (1977), Bowyer and 
Humphreys (1979), Flexser and Tulving (1978), and Ley (1977) for more detailed 
accounts and analyses of context effects. 
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Figure 10.9 Split-brain experiment. The corpus callosum is severed. When a stimulus is 
presented in the left field only, all messages are conducted to the right hemisphere. 
When a stimulus is presented in the right field only, all neural messages go to the left 
hemisphere. 


ONE BRAIN OR TWO? 

As a final example of the kinds of distinction that are being made in the study of 
memory and information processing we shall address some very intriguing sug¬ 
gestions concerning brain functions that come from what have been called “split- 
brain” experiments (Sperry, 1961). The vertebrate brain is surprisingly symmet¬ 
rical. The structures of the left side of the brain seem to be duplicated perfectly on 
the right side. This symmetry poses some very interesting questions concerning 
the location of memory “traces.” Is all information processed equally on both 
sides of the brain? If the structures on both sides of the brain are identical, is the 
information processed within them identical? 

Interestingly, the answer seems to be no. Gazzaniga (1970, 1972) and Gazzani- 
ga and Sperry (1967) investigated a number of humans whose two halves of the 
brain had been separated surgically. Very simply, the two halves, or hemispheres, 
of the brain are normally connected, or bridged, by an extensive bundle of fibers 
known as the corpus callosum. In order to alleviate severe epilepsy, some patients 
undergo an operation in which almost all of their corpus callosum is removed. In 
other words, the two halves of the brain are essentially “disconnected” as in 
Figure 10.9. As it turns out, it is possible to deal with these disconnected halves 
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Answers to Definitions in Box 10.1 


despot 

6. surrogate 

11. geyser 

geode 

7. quintuplets 

12. gizzard 

meridian 

8. charisma 

13. chlorophyll 

venerable 

9. hemoglobin 

14. yak 

deciduous 

10. pterodactyl 

15. ventriloquism 


independently. Entry into one or the other half can be accomplished by restricting 
visual stimuli to one half of the visual field. A stimulus flashed to the left half of 
the visual field of either eye will be relayed to the right side of the brain. A visual 
stimulus flashed to the right side of the visual field of either eye will be relayed to 
the left side of the brain. By restricting input to half of the visual field investigators 
can be sure that they are dealing with but half of the brain in a “split-brain” 
patient (see Figure 10.9). 

In testing split-brain subjects Gazzaniga and his colleagues found that the left 
half of the brain tends to be relatively specialized for active speech events, such as 
talking and writing. The left side of the brain can read words and initiate speech. 
When a printed word is presented to the left side of the brain, the split-brain 
subjects can read it and say it aloud. The right side of the brain seems to be more 
involved with nonverbal information, such as spatial reasoning. It does not do too 
well with verbal production. For example, if a word is flashed on the left half of 
the visual field (right hemisphere of the brain), subjects appear unable to say the 
word aloud. But if they are asked to recognize the word rather than recall it, they 
can do so. The right half of the brain apparently has the ability to recognize verbal 
units but not to initiate verbal responses. The notion that the left and right halves 
of the brain are specialized for verbal and nonverbal factors, respectively, gains 
support from the work of other investigators (see Galin & Ornstein, 1972; 
Milner, 1968). 

This line of research does not suggest that the two hemispheres of the brain can 
handle only certain types of information, nor does it imply that the hemispheres 
do not “communicate” with each other. To the contrary, in normal individuals 
half of the brain seems capable of “taking over” the functions of the other half if 
the other has been damaged or incapacitated. In addition, the corpus callosum 
apparently provides a means of transferring information processed on one side of 
the brain into the other. But it does appear that the two sides are specialized for 
distinct functions (see also Hellige, Cox, & Fitvac, 1979; Hellige & Wong, 1983; 
McKeever & Hoff, 1983; Searleman, 1977). 


SUMMARY 

1. Many divisions of memory have been proposed. This chapter considers 
several important distinctions. 

2. The first widely discussed distinction is between short-term memory and 
long-term memory. Several areas of research bear on this issue. 

3. At one time it was suggested that interference effects (PI and RI) operate in 
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long-term memory but not short-term memory. This proposal has been 
disproved. Both PI and RI operate in short-term memory. 

4. Release from PI demonstrates that similarity effects operate in STS as well as 
LTS interference situations. 

5. Early research suggested that short-term memory might involve acoustic 
coding processes , whereas long-term memory might involve semantic coding 
processes. This conclusion was drawn from studies that seemed to indicate 
that subjects would confuse acoustically similar materials in short-term 
memory and confute semantically similar materials in long-term memory. 

6. But the acoustic—semantic distinction has been rejected. It now appears that 
short-term and long-term memory are flexible in terms of coding. Depending 
on the demands of the task and the desires of the subject, short-term and 
long-term memory may involve acoustic coding, semantic coding, or both. 

7. The recency effect in the free recall of a serial list refers to the fact that the 
items toward the end of the list are recalled better than earlier ones. 

8. This recency effect has been interpreted in terms of the two-memory position. 
Items toward the end of the list are presumed to be in short-term memory at 
the time of recall and, thus, easily recalled. Earlier items are no longer in 
short-term memory (lost or transferred to long-term memory) and are less 
likely to be recalled. 

9. Wickelgren argues that the recency effect can also be interpreted in terms of a 
single-memory position. If we assume that memory traces are lost rapidly at 
first and then more slowly, there is no need to postulate two memory stores. 

10. Negative recency refers to the fact that recall of the end items will drop below 
recall of earlier items if the recall test is delayed. 

11. Negative recency has also been interpreted as evidence for two memory 
stores. But alternative interpretations are available. 

12. Patients who have suffered bilateral damage of the hippocampus have some¬ 
times been unable to form new long-term memories. Old long-term memo¬ 
ries remain intact, as does the entire short-term memory system. The loss of 
the ability to transfer information from short-term store to long-term store 
has been taken as evidence for the two-memory position. But alternative 
interpretations are available. 

13. Amnesic effects do not appear to be as total or permanent as the separate- 
store position would suggest. 

14. The distinction between short-term memory and long-term memory has not 
been conclusively substantiated. 

15. The second major distinction of the chapter is between recall and recogni¬ 
tion. In a recall task we ask, “What is the item?” In a recognition test we ask, 
“Is this the item?” Recognition tends to be superior to recall, and sometimes 
it is far superior. 

16. The unitary-strength hypothesis holds that recognition and recall do not 
involve fundamentally different processes. Recognition is merely seen as a 
more sensitive indicator of memory traces within a single-memory frame¬ 
work. 

17. But some variables (for example, common versus uncommon items) affect 
recall and recognition in opposite ways. This fact has led many to suggest 
that the two are truly different. 
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18. The generation-recognition hypothesis holds that recall involves encoding, 
storage, and retrieval, whereas recognition involves only encoding and 
storage. 

19. The control hypothesis argues that we differ in how we use our single¬ 
memory system when we are encoding for recall or encoding for recognition. 

20. We may organize more when coding for recall than when we code for 
recognition. 

21. Recognition failure refers to the fact that we can sometimes recall mate¬ 
rial that we have been unable to recognize. This phenomenon is not predicted 
by either the unitary-strength hypothesis or the generation-recognition 
hypothesis. 

22. Episodic memory versus semantic memory refers to the idea that one model 
may be required for the storage of temporally ordered personal experiences 
(episodic) and another for nontemporal facts (semantic). 

23. The tip-of-the-tongue phenomenon and flashbulb memories both bear upon 
the semantic—episodic distinction. 

24. Many memories may have both episodic and semantic characteristics. 

25. Some have suggested that we shall need different models to account for 
human and animal memory. 

26. Yet animal and human memory systems appear to have much in common. 
For example, PI and RI have been observed in animal situations. Extinction 
and spontaneous recovery have also been implicated in human as well as 
animal situations. 

27. The delayed-matching-to-sample procedure is used to study short-term mem¬ 
ory in animals. Although not identical to the distractor technique, it is 
similar. 

28. Spatial memory has been shown to be roughly equivalent in humans and 
animals. 

29. Both animals and humans engage in serial-pattern learning. 

30. Some information may be processed automatically and some in a controlled 
manner. 

31. If we learn a task in one situation, we will do better on a retention test if we 
are tested in the same rather than a different situation. Such state-dependent 
memory emphasizes the importance of context in memory and can be dem¬ 
onstrated when the context involved is geographical, chemical, verbal, or 
visual. 

32. Split-brain research suggests that the two hemispheres of the brain tend to 
process different types of information. 
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INTRODUCTION 

The psychology of memory is currently awash with studies of “organization in 
memory,” “structure in memory,” and “elaboration in memory.” The meaning of 
these phrases varies from investigation to investigation. At present, there is no firm 
consensus concerning the definition of organizational processes within the overall 
memory process. And yet there do seem to be common elements among the 
various uses of the phrases. Usually, these phrases can be used interchangeably. 
The purpose of this chapter is to introduce the complexities of the “organization¬ 
al,” “elaborative,” or “structural” approach to'memory. 

This approach is not completely distinct from alternative ways of looking at 
memory. In fact, it has much in common with other interpretations, particularly 
information-processing theory. In the structural approach, however, the emphasis 
is on organizational processes within the overall memory system. The structural 
approach is not bound up with any one theory, as is the interference approach. 
Rather, it is an orientation toward the question of memory that stresses the fact 
that we do not passively store information exactly as it is delivered to us. Instead, 
we structure, interrelate, elaborate, and organize it. We try to systematize in¬ 
formation so that both storage and retrieval will be efficient. As Cofer (1975) puts 
it, we operate as though our memory system resembles an encyclopedia , a 
dictionary , or a library. 

Let us begin our discussion with an example. Consider a game that children 
play when they are trying to fight off boredom on long car trips. The first player 
begins by saying, “I went to the store and bought an apple.” The second player 
says, “I went to the store and bought an apple and a knife.” The third says, “I 
went to the store and bought an apple, a knife, and a shirt,” and so on, until 
someone makes an error in recalling the building chain of items. Each player must 
recall all the previous items and add one new one. How might you go about 
remembering the growing list of items? One technique might involve the simple 
rehearsal, or repetition, of the items over and over in the correct order. But there 
are other possibilities. Some people find it convenient to build mental “images” 
with the items. Thus, one player might imagine a green apple impaled on a golden 
knife and wrapped in a blue shirt. When recall is required, she merely summons 
up the “picture,” or image, and reads off the necessary items. These images 
(whatever they are) can vary tremendously. They may be simple or complex, 
active or passive, ordinary or unusual, and so on. This process of relating items to 
one another in a visual image is one example of an organizational, or structural, 
strategy. The player is doing more than rehearsing items. She is structuring the 
items and relating them to one another, through the use of a mental image. 

Another player might remember the same items by constructing a sentence. The 
words in the sentence might begin with the first letters of the to-be-remembered 
items. Thus, “Another king said . . .” might provide a vivid means for remember¬ 
ing apple, knife, shirt. The player relates the items to one another, or organizes 
the materials, by imbedding them within a sentence. Another individual might use 
a sentence construction technique of a different sort. This player might make up a 
sentence using the items themselves rather than their first letters—for example, 
“The old man picked up the apple and looked for his knife in his shirt pocket.” 

Each of these players is organizing, imagining, structuring, or relating the 
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to-be-remembered items in a different manner. The number of possible organiza¬ 
tional schemes, or strategies, is almost unlimited. We can categorize and group 
items. We can string them in series. We can relate them to one another and to 
information we bring into the situation. It is this vast array of mental activities 
that is referred to as organization in memory. 

In a simple sense organizational processes refer to whatever we do to informa¬ 
tion between input and output. We can systematize, imagine, relate, order, select, 
elaborate, and transform information. It is clear why there tends to be some 
uncertainty and confusion concerning the exact nature of organization in mem¬ 
ory. A definition of organization would have to encompass an enormous number 
of varied mental activities. We should not be dismayed by the diversity of interests 
and activities that have fallen under the rubric of organization. Rather, we should 
look on the disorganized field of organization as an interesting, stimulating 
approach to the study of memory (see Murdock, 1974). 

In attempting to characterize the evolving organizational approach we shall 
discuss several areas of research that have been subsumed by the heading. We 
shall be discussing experimental demonstrations of the individual’s ability to 
improve retention through the use of organizational processes. For a fuller review 
of organizational research you are referred to Melton and Martin (1972) and 
Reynolds and Flagg (1983). 

Before we turn to our review of specific areas of research, we should note that 
an interest in organization in memory is not new. In fact, current interest in such 
processes really represents a return to areas that have long been of interest to 
psychologists. We are witnessing a swing of the pendulum, so to speak. As 
Postman (1972) and others have pointed out, one of the basic tenets of Gestalt 
psychology was that memory is governed by principles of organization and 
perceptual grouping. In the early days methods for measuring organizational 
activities were lacking. As a result the concepts of Gestalt psychology never 
obtained firm experimental support. Lack of adequate experimental techniques 
forced psychologists to lay the questions aside. But interest in the issues raised by 
the Gestalt psychologists never completely disappeared. The past two decades 
have seen a renewal of interest in organization in memory. This explosion of 
activity was made possible by the development of some new techniques for 
measuring and quantifying the elusive organizational activities. Some of these 
new techniques and the data they have yielded will be discussed in the sections 
that follow. 

Actually, we have already begun our study of organization. For example, our 
discussion of semantic-network models in Chapter 9 represents an instance of 
concern for structure in memory. So keep in mind that the topics and issues we 
discuss here are inherent in many of the topics we have already discussed. 

EARLY WORK: WORD ASSOCIATIONS 
Some early, relatively unsophisticated evidence for the existence of organization 
in memory was revealed by the development of the word-association test (Kent & 
Rosanoff, 1910). The test consists of 100 common words. The subjects’ task is to 
“give the first word that comes to mind” in response to each of the 100 words. It 
is an interesting test, and one that most individuals do not mind at all. As it turns 
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Table 11.1 Word-Association Responses Given by 100 Subjects to Some Selected 
Stimulus Words 


STIMULUS 

RESPONSE 

FREQUENCY 

RESPONSE 

Woman 

59 

Man 


10 

Child, girl 


4 

Sex 


2 

Love, mother 


1 

Friend, world, clean, thin, hair, old, soft, pretty, female, 
baby, home, Pady, fold 

Chair 

55 

Table 


15 

Sit 


4 

Seat, desk 


2 

Legs, couch, bed 


1 

Study, home, blue, soft, foot, arm, plastic, yellow, 
wood, rest, hard, window, floor, comfortable, 
professor, house 

Blossom 

74 

Flower 


7 

Apple 


4 

Rose 


2 

Springtime 


1 

Cut, pretty, bud, bouquet, Pasadena, sniff, smell, blooms, 
red, tree, plant, sweet, orange 


Unpublished data obtained by the author. 


out, there are some extremely common responses to many of these words. Thus, 
mother often evokes father , and black elicits white with consistently high prob¬ 
abilities. This is not to say that everyone gives these common responses. Some 
people have come up with unique responses, to say the least (see Table 11.1). But a 
great number of individuals from various groups over the years have given the 
same responses (Fox, 1970; Jenkins & Russell, 1960). Mackenzie (1972) provides 
a review of much of the research being done with word associations (see also 
Bilodeau & Howell, 1965; Entwisle, 1966; Rubin, 1983). 

But for our purposes the important point is that these consistent patterns of 
word associations suggest that words are organized, or structured, within each of 
us. They are not merely lumped, or piled randomly, within us somewhere. They 
are related to one another in a distinct manner. In a very rough way the word- 
association test reveals some of this structure. 

It should come as no surprise to learn that this structure can change as we grow 
and develop. Petrey (1977) gave a word-association test to kindergarten children 
and to first-, third-, and fifth-graders. He found a dramatic shift in the associa¬ 
tions these children gave as they grew older (see Table 11.2). When they were very 
young, the children tended to give episodic responses. If you will recall, episodic 
memory is memory for temporally related events or episodes. The young chil¬ 
dren’s associations showed a distinct autobiographical quality (for example, go to 
bed in response to dark), whereas older children gave more semantic responses to 
the same stimuli (for example, light to dark). (A semantic response is one that has 
to do with meaning rather than episode.) 

The word-association test has been used in clinical situations. Some psy¬ 
chotherapists believe that they can learn something about individuals by ex¬ 
amining their responses on a word-association test. The responses may provide 
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Table 11.2 Children’s Word-Association Responses, Showing the Shift from Episodic to 
Semantic Associations as the Children Grow Older 


GRADE LEVEL 


STIMULUS 

RESPONSE 

KINDER¬ 

GARTEN 

1ST 

3RD 

5TH 

Examine 

Doctor, or eye 

23 

46 

49 

27 

Examine 

Look at, check up, or test 

1 

7 

51 

120 

Dark 

Moon, stars, go to bed, or go 
to sleep 

23 

25 

3 

1 

Dark 

Light 

25 

85 

185 

180 

Long 

Grass 

16 

11 

7 

5 

Long 

Short 

9 

56 

177 

164 

Give 

A present 

24 

18 

4 

5 

Give 

Take 

2 

6 

56 

99 


Adapted from Petrey, S. Word associations and the development of lexical memory. Cognition, 1977, 5,57—71. Table 1, p. 64. Used 
by permission of Elsevier Science Publishers B. V. 


the therapist with insights into the patient’s personality or mental state. If in 
response to mother the patient says father, home, or love , the therapist may be 
led to one set of conclusions. But if the patient’s response to mother is blood vein, 
rape , or horror , the therapist might begin to think along different lines. Though it 
does not provide us with a great deal of information about mental organization, 
the word-association test does represent an early, and ongoing, technique for 
tapping into existing memory structures. 

CHUNKING AND REWRITING: 7 ± 2 
Suppose that we are presented with the following letters, one at a time: F, L, S, M, 
P, U, J, E. We are then asked to recall them immediately in any order. We might be 
able to accomplish this feat, but we would certainly be hard pressed to handle any 
more information. But it is possible to organize this material in such a way that 
our memory would hardly be taxed at all. For example, we can rearrange the 
letters to spell JUMPS and ELF. By doing this we can remember all eight letters 
easily and have plenty of memory capacity left over. We could easily handle a lot 
more letters, especially if they could also be arranged as words. By organizing the 
information we have greatly expanded the capacity of our immediate memory. 

The concept of the memory chunk was introduced by Miller (1956). Miller 
argues that our immediate memory is limited by the number of chunks of 
information that it can handle but that the amount of information contained in 
each chunk may vary. As long as the information can be expressed as a single 
response, then it represents one chunk. Thus, JUMPS and ELF each represent one 
chunk, but each chunk contains several units of information (letters). In a sense 
the process of increasing the size of a chunk seems to involve our ability to 
characterize many different units of information with fewer and few'er responses. 
Thus, instead of having to say F, L, S, M, P, U, J, E (eight responses, or eight 
chunks), we say JUMPS and ELF (two responses, or two chunks). By transform¬ 
ing eight responses into two responses we are freeing ourselves to handle addi¬ 
tional information. Immediate memory does not appear to be limited by number 
of physical stimuli. Instead, it is limited by the number of chunks we can handle. 
Miller’s best guess was that we can handle seven plus or minus two chunks of 
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information. Each chunk may vary in terms of the amount of information it 
contains. Thus, O is one chunk, SO is one chunk, SOP is one chunk, and STOP is 
one chunk. It is just as easy to remember one of these as another, even though they 
van in terms of number of letters. If we can increase the amount of information 
contained in each chunk, we can greatly expand our memory capacity, even 
though it is limited to approximately seven chunks. 

Here is another example of the way in which we can increase the amount of 
information contained in each of our seven plus or minus two chunks. Suppose 
that we are presented the following binary digits and asked to recall the sequence: 
101000100111001110. See if you can read the sequence once and then recall it 
correctly. Now, as it stands, that is a rather difficult task. The sequence contains 
18 bits of information, and it probably exceeds our memory capacity. But suppose 
that before reading off the sequence we had learned the following: 0 means 00, 1 
means 01, 2 means 10, and 3 means 11. In other words, we have given new names 
to all possible sequences of two binary digits. Having learned this new language 
we are now ready to tackle the original sequence. As the sequence is being 
presented, we can recode it in terms of our new language. Thus: 

Original 10 10 00 10 01 11 00 11 10 

Recoded 220213032 

To remember the original sequence of 18 binary digits all we have to do is 
remember 220213032. This may still exceed our memory span, but it certainly 
represents an improvement. We have cut the number of units we must remember 
in half. WTien asked to recall the original sequence, we merely recall as much of 
220213032 as we can and decode it as we go along. In other words, by organizing 
the to-be-remembered information (through rewriting) we have vastly improved 
retention. 

There are experimental data supporting Miller’s contention that our capacity to 
process information is limited. Consider Figure 11.1. This figure contains data 
obtained by Graesser and Mandler (1978). They gave subjects sets of nouns 
ranging in number from 1 to 12. On receiving a group of nouns the subjects’ task 
was to find a dimension, feature, or attribute that linked the set members 
together. For example, if a particular set contained dog and goat, the subjects 
might say that they were both animals or that they both had a g in them. Graesser 
and Mandler measured the latencies of these responses. If you will notice, the 
number of seconds that it took subjects to find some linking dimension or feature 
increased in a linear fashion up to about 6 nouns per set. Then a distinct break 
occurred, and after that (from about 6 items to about 12 items per set) the rate of 
increase in the number of seconds required to find a response was lower than the 
increase observed up to 6 nouns per set. The authors argue that this break, or 
change in slope, is evidence for the notion of a limited processing capacity. 

They explain the decreased slope this way: With up to 6 nouns per set the 
subjects are trying to find a dimension or feature that will link all items in the set. 
But when the subjects are given more than 6 nouns per set, they cannot handle 
them all at once; the task exceeds their limited capacity. So what do they do? They 
begin to find dimensions, or features, that will link up to 6 nouns but will not 
cover the extra nouns that the experimenter has piled on. The slope decreases 
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Figure 11.1 Mean response times for the production of responses relating the members of 
sets to one another as a function of set size. (From Graesser, A., and Mandler, G. Limited 
processing capacity constrains the storage of unrelated sets of words and retrieval from 
natural categories. Journal of Experimental Psychology- Human Learning and Memory, 
197S, 4, 86-100. Fig. 1, p. SS. Copyright 197S by the American Psychological 
Association. Reprinted by permission.) 

after 6 nouns per set because the subjects can select from among the provided 
nouns and deal with nouns that are easily related to one another; with 6 or fewer 
nouns the subjects try to deal with everything the experimenter presents, regard¬ 
less of how difficult it is to respond to. 

The authors, in fact, report that the responses that subjects gave to sets 
containing up to 6 nouns tended to cover all 6 nouns, whereas the responses they 
gave to sets containing more than 6 nouns did not cover, or link, all of the nouns 
in these larger sets. Their conclusion is that subjects can handle, or process, only 
about 6 items at one time in this kind of task. 

RECALL BY CATEGORY 
Clustering 

If you have a pencil and paper handy, you might want to try the following 
experiment. Read the list of words contained in Table 11.3, and then immediately 
write down as many as you can recall in any order you wish. Now, having 
exhausted your ability to recall the items, look over your responses. Notice that 
you have probably grouped the items according to categories. Thus, you may 
have recalled two or more trees together or several animals together. Similarly, 
you may have recalled colors or cities together. This grouping phenomenon is 
called clustering in recall. It refers to the fact that items drawn from various 
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Table 11.3 Free-Recall List Used in Clustering Demonstration 


HEMLOCK 

CARROT 

PARIS 

BLACK 

LETTUCE 

ELM 

ZEBRA 

pp AQ 

NEW YORK 
RED 

SEATTLE 

MOOSE 

BROCCOLI 

WOLF 

BLUE 

COW 

PINE 

TOMATO 

BROWN 

SYCAMORE 

MOSCOW 

CROW 

REDWOOD 

MAPLE 

SQUASH 

TOLEDO 

ORANGE 

YELLOW 

AMSTERDAM 

SNAKE 


categories, and presented randomly , will be recalled together. In other words, 
there is not a one-to-one relationship between order of input and order of output. 
It seems that we do some organizing in there somewhere. 

The concept of clustering has a long history. Its emergence as an important 
phenomenon began with the work of Bousfield (1953; Bousfield & Sedgewick, 
1944). Since then many interesting questions have been asked. For example, does 
total recall of items from the list go up as clustering goes up? In other words, is 
there any evidence that suggests that organizing according to categories goes hand 
in hand with improved retention? The answer to this question must be a qualified 
yes. The relationship between clustering and overall recall has not been as strong 
as one might have expected. But several studies do seem to suggest a positive 
relationship between clustering and recall. You are referred to Cofer (1967), 
Forrester and King (1971), Puff (1970), Shuell (1969), Thompson, Hamlin, and 
Roenker (1972), and Weist (1972) for a full exposition of this issue. 

Another example of the kind of research done in connection with category 
clustering has to do with the difference between exhaustive and nonexhaustive 
categories. When a list involves nonexhaustive categories, the list does not con¬ 
tain all of the members of the categories. For example, if our list includes 15 
vegetables, we have not exhausted the category of vegetables. Exhaustive catego¬ 
ries usually involve very few members, all of which are included in our list. For 
example, a category of seasons may be exhaustive (summer, fall, winter, spring). 
The planets of our solar system could be an exhaustive category. Psychologists 
have asked which of these two types of category produces (1) the best overall 
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recall and (2) the greatest amount of clustering. Cohen (1963, 1966) presents data 
suggesting that both total recall and clustering are greater for exhaustive than for 
nonexhaustive categories. It seems that we are more able to work with exhaustive 
categories, perhaps because we know when we are done with a category. Our 
organizational efforts seem to yield better retention in this instance. These exam¬ 
ples should provide the flavor of the research being done in connection with 
category clustering (see also Buschke, 1977; D’Agostino, 1969; Hubert & Levin, 
1977; Segal, 1969). 

Subjective Organization 

Tulving (1962) is the person responsible for the concept of subjective organiza¬ 
tion and for the associated technique for assessing organization in memory. In a 
typical subjective organization (SO) experiment the subjects receive a long list of 
randomly presented items, one at a time. The items do not belong to any obvious, 
experimenter-imposed categories. The subjects are then asked to recall as many of 
the items as they can, in any order. Following recall the list is again presented, 
usually in a new random order. A series of recall and presentation phases follow 
in this fashion. Tulving and others have found that the order in which the subjects 
recall these items begins to assume some consistency from recall trial to recall 
trial. The subjects begin to recall certain words together even though they are not 
presented together. The subjects are somehow ordering, or structuring, the mate¬ 
rial. Their output becomes consistent despite random input. Clearly, the subjects 
are free to use any organizational strategy they wish. They might build a visual 
image, construct a story, relate items that sound or look alike, or use some other 
unique, individual scheme. In any case the constant order in which the subjects 
recall the items reveals that they are imposing some sort of structure on the 
materials. 

At the same time it should be noted that the SO technique does not tell us 
exactly what this developing organization is (see Mandler, Worden, & Graesser, 
1974). All we know is that something is happening. This fact underlines an 
important difference between the clustering technique and the SO procedure. In 
the clustering method, on the one hand, we do have some idea of the type of 
organizational activity that is occurring. As experimenters we define the catego¬ 
ries and impose them on the subjects. In SO studies, on the other hand, we can 
observe the development of idiosyncratic organizational strategies unhindered by 
any experimenter-imposed constraints. 

Does increasing SO lead to greater recall? This would seem to be a critical 
question. Unfortunately, the available data are equivocal. Several studies have 
reported a positive relationship between overall recall and subjective organization 
(Allen, 1968; Mayhew, 1967; Tulving, 1962, 1964). Others have failed to find the 
expected relationship (Carterette &Z Coleman, 1963; Postman, 1970). There are 
at least two ways to view the problem (Wood, 1972). First, these results may 
mean that SO does not facilitate retention. Second, a true and strong relationship 
between this type of organization and overall recall may so far have been 
obscured by methodological problems. 

In any case, whether or not Tulving’s original formulation of SO stands the test 
of time, it must be recognized that the concept has had considerable impact on 
our thinking and research. It is only reasonable to assume that it will be modified 


CHAPTER 11/IMPROVING MEMORY 307 










and supplemented by newer techniques. In a sense, Tulving’s SO helped open the 
floodgates and allowed the current emphasis on organization, structure, and 
elaboration to become widespread. 

ATTENTIONAL STRATEGIES 

Organizational activities take many additional forms, and information can be 
treated in many different ways. One interesting process involves not the elabora¬ 
tion or transformation of information but the selection of specific, usable stimulus 
information from within an array of available stimulus information. What we are 
talking about here is the fact that individuals do not always use the entire stimulus 
presented to them. They may select some portion or element of the overall 
stimulus and associate the response with that selected portion. This process is 
often termed stimulus selection , and it is particularly interesting because it 
appears quite consistently in both animal and human learning situations. Stimu¬ 
lus selection, or cue selection , as it is sometimes called, provides us with one more 
example of the way in which animal and human capacities run parallel to each 
other. 


Humans 

Let us begin with an example involving human selection. In a paired-associate 
situation the subjects first learn a list in which the responses (for example, digits) 
are paired with different compound stimuli (for example, nonsense syllables 
surrounded by different colors). An example of a stimulus would be XGK 
surrounded by a blue rectangle. After learning the list the subjects are given each 
component of the stimuli individually (for example, a color by itself or a nonsense 
syllable by itself) and tested for recall of the responses. If one component is more 
likely to elicit the response in the recall test, we begin to suspect that some sort of 
selection process has been going on during the original learning phase. Using this 
procedure Underwood, Ham, and Ekstrand (1962) found that subjects selected 
the colors over the syllables (see Figure 11.2). This suggests that the colors are 
somehow more easily utilized than nonsense syllables in building associations. 

The investigation of stimulus selection in paired-associate learning has led to all 
sorts of concerns. For example, what determines which component of a com¬ 
pound stimulus will be selected? As it turns out, there are many different variables 
that affect the selection process. Postman and Greenbloom (1967) found that, if 
the stimuli are difficult-to-pronounce trigrams (for example, XZC or QKX), the 
subjects tend to select the first letter in the syllable (X or Q). Thus, ordinal 
position seems to be important in stimulus selection. Other things being equal, we 
will select the component in the left position, presumably because we read from 
left to right. Additional variables have been investigated and discussed by Hous¬ 
ton (1967), Merryman and Merryman (1971), Richardson (1971, 1972), and 
Richardson and Chisholm (1969). In a very general sense it seems that the 
component that becomes most effective is the one that is most easily utilized as a 
cue. In other words, subjects follow the path of least resistance and utilize 
whatever they perceive to be the most convenient cue in a given situation. Since 
subjects and situations differ enormously, it is easy to see why the selection 
process may vary so much. 


308 PART FOUR/RETENTION 







Transfer trials 


Figure 11.2 Acquisition curves on ten test trials. During these trials the TC group 
received both the colors and the syllables as stimuli, the C group received only the colors, 
and the T group received only the syllables. (Adapted from Underwood, B. J., Ham, M., 
and Ekstrand, B. R. Cue selection in paired-associate learning. Journal of Experimental 
Psychology, 1962, 64, 405-409. Copyright 1962 by the American Psychological 
Association. Reprinted by permission.) 


Animals 

Animals display stimulus selection, too (Kamin, 1969a; Wagner, 1969; Wagner, 
Logan, Haberlundt, St Price, 1968). It appears that they process information in 
ways roughly parallel to those of the human. For example, Kamin (1969a) 
demonstrated a form of stimulus selection in a classical conditioning situation. 
Animals experienced a series of pairings of shock (UCS) with a particular CS, 
which we shall designate as A. This procedure resulted in an emotional response’s 
becoming conditioned to the previously neutral CS (A). After these initial pair¬ 
ings an additional CS (X) was added to the situation. The CS became AX. AX 
was then paired with the shock (UCS) for a series of trials. The question was 
whether the emotional response would become associated with the new stimulus 
component (X). Relative to an appropriate control condition, it did not. The A 
component had already been associated with the emotional response produced by 
the shock. The introduction of an additional cue (X) was redundant. 

This effect represents an organizational effort on the part of the animal. It 
seems to correspond to the sort of effects that are obtained with humans. In both 
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cases (animal and human) a response is associated with some portion of a total 
stimulus array. Performance is facilitated by an efficient distribution of attention 
and effort. 


DO MENTAL IMAGES EXIST? 

Think of a giraffe . . . Take your time . . . Make it a small one. About 3 feet tall. 
Pink toenails. Now put the giraffe in a fire engine. More than likely what you 
have been experiencing (if you have been wijling to engage in this sort of 
nonsense) are what psychologists call visual “images.” These images are of 
considerable interest to psychologists because they are such vivid, subjective 
experiences and because they seem to be an important aspect of our mental 
activity. They have long been of interest (see Lea, 1975), but at the same time they 
have been troublesome because they are so difficult to quantify or to measure. For 
example, how would you go about comparing your image of a giraffe with 
someone else’s image of a giraffe? We cannot observe images directly. We cannot 
lay them out on the table and measure or weigh them. They are such fleeting, 
subjective experiences that we have some trouble talking about them at all. And 
yet they do seem to be important components of thought. In this section we shall 
discuss some of the current efforts to deal with these elusive, important experi¬ 
ences. Specifically, we shall concern ourselves with the role that these images play 
in the memory process. As we shall see, retention can be heavily affected by vivid 
mental imagery. The formation and use of images is a complex and important 
process (see Kosslyn, Reiser, Farah, & Fliegel, 1983). 

Before we turn to the nature of the impact of images on memory, we should ask 
a very basic question: Is there any experimental evidence for the existence of 
mental images above and beyond the subjective experiences we all have? In this 
section we review three different kinds of experimental evidence that have been 
taken by some to mean that visual imagery is a perceptual experience analogous 
to “pictures in the head.” It is difficult to account for these findings if one tries to 
insist that all encoding is verbal. 

Scanning Experiments 

Scanning experiments, which have been taken as evidence for the existence of 
visual images, are based on the assumption that the time it takes to scan between 
two objects will increase as the distance between those two objects is increased. 
Let us simplify the situation by describing an experiment. Kosslyn, Ball, and 
Reiser (1978) had subjects study the map contained in Figure 11.3. The subjects 
were asked to learn the locations of the objects marked with an X on the map. 
The map was then removed. Subjects were asked to form an image of the whole 
map and to focus on one of the objects named by the experimenter. When the 
experimenter named a second object, the subjects’ task was to imagine a black dot 
moving in a straight line from the first object to the second object. When the black 
dot reached the second object, the subjects stopped a timer by pushing a button. 
In agreement with the idea that the subjects were somehow scanning a mental 
visual image, the time it took to press the button increased as the distance 
between the two objects increased. It is difficult to imagine how this effect would 
have appeared if the subjects had verbally rather than visually encoded the spatial 
information. 
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Figure 11.3 A fictional map used to study the effect of distances on mental scanning time. 
(From Kosslyn, S. M., Ball, T. M., and Reiser, B. J. Visual images preserve metric spatial 
information: Evidence from studies of image scanning. Journal of Experimental 
Psychology: Human Perception and Performance , 1978, 4, 47—60. Copyright 1978 by 
the American Psychological Association. Reprinted by permission.) 


The interpretation of this kind of scanning effect is complicated, however, by 
results reported by Kerr (1983). He used a flat board with raised objects attached 
to it (see Figure 11.4). As in the Kosslyn, Ball, and Reiser study, subjects were asked 
to learn the positions of the objects and then asked to mentally scan from one to 
another. But the unique and crucial aspect of the Kerr study was that the subjects 
were congenitally blind. They learned the positions of the raised objects by touch. 
Then they were asked to form an image of the whole board (without using words 
like look or see) and to scan from one object to another. The author’s argument 
here is that, if the scanning effect is essentially a visionlike experience, it should 
not appear with congenitally blind subjects. But it did; the time it took blind 
subjects to scan from one object to another increased as the distance between 
those objects increased. Kerr suggests that images of spatial relationships are 
therefore not necessarily like “pictures in the head.” His results do not prove that 
visual images do not exist. They do suggest, however, that other, perhaps non¬ 
visual, images may also be utilizable by the human. 

Mental Rotation 

A second kind of support for the existence of mental images comes from mental 
rotation studies. Consider the objects in Figure 11.5. Your task is to decide if the 
two members of each pair are the same or different. Shepard and Metzler (1971) 
asked subjects to judge 1,600 such pairs of objects. The subjects pulled one lever 
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Figure 11.4 The board and figures used in a scanning experiment with the congenitally 
blind. (Adapted from Kerr, N. H. The role of vision in visual imagery experiments: 
Evidence from the congenitally blind. Journal of Experimental Psychology , 1983, 
265-277. Fig. 1, p. 267. Copyright 1983 by the American Psychological Association. 
Reprinted by permission.) 



(a) 




(b) 




Figure 11.5 Examples of pairs of patterns differing in orientation. (From Shepard, R. N., 
and Metzler, J. Mental rotation of three dimensional objects. Science , 1971, 777, 
701-703. Copyright 1971 by the American Association for the Advancement of Science. 
Reprinted by permission.) 
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with their right hand if they believed the two objects in a given pair were the same 
and a different lever with their left hand if they concluded that the two objects 
were not the same. The experimenters carefully measured how long it took the 
subjects to make these decisions. 

The crucial aspect of the experiment was that members of each pair differed in 
orientation from 0 degrees to 180 degrees. It was found that the more the two 
members of a given pair differed in terms of orientation, the longer it took the 
subjects to make a decision about them. This finding is consistent with the idea 
that subjects imagined one object rotating until it either matched or failed to 
match the other object. The farther they had to rotate the object, the longer the 
decision took. When the subjects were asked how they had performed this task, 
they indicated that they had rotated a visual image of one of the objects until it 
had the same orientation as the other. 

Although the exact significance of these findings is not entirely clear (see 
Carpenter &c Eisenberg, 1978), many feel that they demonstrate our ability to, at 
least part of the time, store and use visual mental imagery. 


Imagery and Interference 

There is one more kind of effect that supports the idea that we really do 
experience and use visual mental imagery. Matlin (1983) suggests that you try to 
imagine a good friend’s face with your eyes closed. That is not too difficult, is it? 
But now try to imagine the same face while your eyes wander over the words on 
this page. You will probably have trouble “looking” at the face and the words at 
the same time. 

This kind of interference effect has been brought into the laboratory by Brooks 
(1968). Consider the material in Figure 11.6. Subjects first saw a block letter such 
as the C in the figure. The letter was then removed, and the subjects had to decide 
from memory whether each corner of the block letter was either at the very top 
or the very bottom of that letter. For example, beginning with the asterisk in Fig¬ 
ure 11.6 and moving clockwise the correct answers are yes, yes, no, no, no, no, 
yes, yes. 

Among other things, Brooks had subjects answer in two different ways. Some 
merely said yes and no verbally. Others had to respond by pointing to Ys and Ns 
in a complex spatial array of letters such as that on the right side of Figure 11.6. 
Brooks felt that pointing would involve a lot of interfering visual processing 
whereas vocalizing would not. In other words, trying to keep the visual image of a 
C in mind while also looking for Ys and Ns would be difficult. Answering with a 
verbal yes or no should not be interfering. Hence, Brooks predicted that perfor¬ 
mance in the pointing condition would be poorer than in the speaking condition, 
and that is what he found. A visual mode of responding interfered with a visual 
task, whereas a verbal mode of responding did not. 

Brooks also turned the situation around and had additional subjects perform a 
verbal task (such as making judgments about words in remembered sentences). In 
this case verbal responding produced interference, whereas pointing did not. 

In summary, Brooks’s work suggests that we do, at least sometimes, store and 
use visual images and not just verbal material. Thus, there are at least three kinds 
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of evidence (scanning, rotation, interference) that suggest we take very seriously 
the idea of visionlike “pictures in the head.” These images need not be whole or 
complete representations of what we see, but they do seem to contain a good 
bit of detail, and we seem to be able to form, store, and use them when the need 
arises. 


WHAT ARE IMAGES? TWO HYPOTHESES 

What exactly are these elusive images that we all experience? Currently, there are 
two major ways of thinking about images. First, there is the perceptual- 
experience interpretation. It proposes that, although we may not actually have 
“pictures in our head,” we do experience something similar to what happens 
when we look at a real object. In other words, an image is a perceptual experience 
that has something in common with the experience of actually seeing something. 
The problem with this formulation is its vagueness. No one seems to be able to 
pin down the nature of this proposed perceptual experience. 

The second hypothesis, called the propositional interpretation , argues that the 
best way to think about images is in terms of abstract propositions (Anderson, 
1978, 1979; Anderson & Paulson, 1978; Hampson & Morris, 1978; Pylyshyn, 
1973). According to this view, images are best thought of in terms of rules and 
words rather than pictures. For example, consider the stimulus “a circle is over a 
square.” According to the perceptual-experience hypothesis, we might store this 
as 8 ; that is, we might construct some form of visual image of the relationship. 
The propositional interpretation, in contrast, would argue that we store some¬ 
thing like {above, circle, square ), which, in the language of logic, is the way that 
we designate the proposition that a circle is above a square. 

Many investigators, including Anderson (1978), argue that it is impossible to 
distinguish between the perceptual-experience hypothesis and the propositional 
hypothesis. These investigators feel that, given the difficulty in trying to articulate 
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the perceptual-experience position, the propositional interpretation is to be 
preferred. 

As we have seen, however, there are also a number of investigators who argue 
that an image is definitely more than an abstract proposition (see Hayes-Roth, 
1979; Keenan 6c Moore, 1979). Neisser (1978), for example, points to the 
following sort of demonstration as support for the perceptual-experience hypoth¬ 
esis. If asked to describe a hawk’s beak, people will report evoking an image of an 
entire hawk and then “inspecting” that image, or searching for the beak. The 
argument is that there really is something like a “picture in the head” that can be 
evoked in its entirety and then scanned for detail. At present there is no clear way 
to decide which of these two hypotheses concerning the nature of images is better. 

DUAL-TRACE THEORY: VERBAL AND VISUAL CODING 
All this talk about visual imagery should not cause us to lose sight of one 
important fact. We do not always encode information in terms of visual images, 
whatever they are. We do not have to use visual imagery. We can, after all, encode 
information verbally as well as visually. If you are trying to remember the word 
horse , you do not have to cook up an image of a horse. You can obviously process 
this item as a word possessing auditory and semantic attributes rather than visual 
attributes. For example, if you wish to, you can probably encode horse both 
visually and verbally (see Schwanenflugel and Shoben, 1983). 

Paivio (1971, 1975, 1978) has proposed what has become known as the 
dual-trace hypothesis. According to this hypothesis, any given stimulus (verbal or 
visual) can be encoded verbally or visually (or both). We have the capacity to 
encode a given bit of information either visually or verbally. Exactly which type of 
coding we use depends on the nature of the information to be coded and the 
demands of the task. Paivio distinguishes between concrete items (horse, boat, 
apple) and abstract items (justice, even, obtuse). He argues that both concrete and 
abstract items are easily coded verbally. But concrete items are much more likely 
to be coded visually than are abstract items. It is easier to come up with a visual 
image for horse than for obtuse. Concrete items are also much more likely to be 
encoded both verbally and visually. An abstract item is much more likely to be 
coded only in a verbal sense. The reader interested in pursuing this line of 
reasoning is referred to the large body of research (Berrian, Metzler, Kroll, 6c 
Clark-Meyers, 1979; Harris, Morris, 6c Bassett, 1977; Kulhavy 6c Heinen, 1974; 
Lutz 6c Scheirer, 1974; Mondani 6c Battig, 1973; Nilsson, 1975; Paivio, 1971; 
Paivio 6c Foth, 1970; Pellegrino, Rosinsky, Cheisi, 6C Siegel, 1977; Pellegrino, 
Siegel, 6c Dhawan, 1975; Wortman 6c Sparling, 1974). 

Although Paivio’s hypothesis has received considerable attention, it would be 
premature to conclude that the relationship between verbal and visual encoding 
has been clearly delineated. 

One final note concerning verbal and visual encoding: On the one hand, several 
authors (for example, Hilgard 6c Bower, 1975) have suggested that verbal encod¬ 
ing is particularly useful for information that is of a sequential nature, or distrib¬ 
uted in time. On the other hand, the visual mode is often seen as a specialized 
means of encoding information that is presented simultaneously and is distributed 
in space. Whether this intriguing possibility will prove to be valid remains to be 
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seen. It is already clear that the distinction is not ironclad. For example, sequen¬ 
tial information can be stored visually. We can encode sequentially presented 
items by placing them in ordered positions within an overall visual image (for 
example, put the early items on the left and later items on the right, or put first 
items on the first floor of a tall building and later items on higher floors). 
Similarly, it seems reasonable to assume that we can encode spatially distributed 
items in a verbal manner. For example, we might encode the information on a 
map by storing “San Francisco is above Los Angeles.” The point here is that 
the mind is flexible enough to utilize both means of coding for both types of 
information. 

In fact, it seems reasonable to assume that much of our natural encoding efforts 
involve both verbal and visual processes. We may seldom use pure visual or pure 
verbal codes. For example, Underwood and Schulz (1960) had subjects learn lists 
of nonsense syllables paired with words. After learning the lists the subjects were 
asked how they had managed the task. Many of them reported some sort of 
activity involving images. Some of their results are contained in Table 11.4. From 
the subjects’ responses it seems clear that their efforts could easily have involved 
both verbal and visual components. Assuming that we can trust subject reports, 
we would be forced to conclude that the verbal and visual modes of encoding 
often go hand in hand. 

People differ in how much elaborating they do, whether it be verbal, visual, or 
combined elaboration. For example, Wang (1983) compared the amounts of 
elaborating that slow learners did with the amounts displayed by fast learners. 
The fast learners showed a marked superiority in how much elaborating they did. 
Even more intriguing is the fact that, when Wang gave the elaborations developed 
by slow learners and those developed by fast learners to new subjects, the new 
subjects receiving the fast learners’ elaborations learned the material more easily 
than did the new subjects receiving the slow learners’ elaborations. In other 
words, people differ in terms of the quality as well as the quantity of the 
elaborations they develop. 

MEMORY FOR PICTURES 

Generally speaking, although there are exceptions (see Postman, 1978), memory 
for pictures is better than memory for the verbal names of those pictures (see 
Intraub, 1979; Loftus & Kallman, 1979). For example, if you are shown a series 
of words including dog, cloud, branch, pencil, and car and asked to remember 
them, you will find it a little harder to do than if you had been shown a series of 
pictures including a dog, a cloud, a branch, a pencil, and a car. 

This difference has stimulated an enormous amount of research and thought 
(see Kosslyn & Schwartz, 1977; Nelson & McEvoy, 1979; Nelson, Reed, & 
McEvoy, 1977; Weaver & Stanny, 1978). The basic question underlying much of 
this work is this: Do we process and store picture stimuli in a manner that is 
different from the way in which we process and store verbal stimuli?. 

As you might expect, some say yes and some say no. The dual-code hypothesis 
discussed previously tPaivio, 1978) argues that pictures are better remembered 
than words because they are more likely than words to be coded both verbally 
and in terms of some kind of visual image. Thus, this position argues that we 
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Table 11.4 Examples of Visual and Verbal Mediators Used by Subjects in Acquiring 
Trigram-Word Pairs 


STIMULUS RESPONSE MEDIATING EVENT 


CYF 

- KID 

XBN 

- GAT 

DSU 

- CAT 

CFY 

- DOG 

XBN 

- RAT 

TPM 

- AND 

RZL 

- SAT 

RZL 

- SAT 

DSU 

- BAN 

RZL 

- KID 

CFY 

- THE 

KHQ 

- FAN 

IGW 

- MAN 

IGW 

- MAN 

RZL 

- BOY 

RZL 

- CAT 


C and K similar in sound. 

Gat is odd word. X is odd letter. 

D to Dog to CAT. 

C to Cat to DOG. 

X suggested poison, and poison, RAT. 

T associated with symbol (&c) for AND. 
R to Rump to SAT. 

R and S in alphabetical order. 

U-BAN (brand of coffee). 

RZL suggested Russell; Russell is a KID 
Remembered because first one in list. 
KHQ to radio to radio FAN. 

W inverted looks like M, hence MAN. 
IG to IGnorant to IGnorant Man. 

RZL looks like lazy: hence, lazy BOY. 

Z is hissing sound of CAT. 


Adapted from Underwood, B. J., and Schulz, R. W. Meaningfulness and verbal learning. Philadelphia: Lippincott, 1960, p. 299. 


do process pictures differently from words; we are more likely to encode them in 
two ways. 

Other investigators are not convinced that pictures are really handled in any 
way that is dramatically different from the way we process words (see Durso 6c 
Johnson, 1979; Hogaboam 6c Pellegrino, 1978; Loftus 6c Kallman, 1979; Loftus, 
Miller, 6c Burns, 1978; Nelson, Reed 6c McEvoy, 1977; Potter 6c Faulconer, 
1975). 


INTERACTING IMAGES 

Given this large and growing body of research concerning images and mental 
processing, can it be applied practically? There do seem to be some areas relevant 
to the problem of improving retention. For example, a number of investigators 
have shown that paired associates are retained better if the stimuli and responses 
of each pair are related to each other through the use of some visual image 
(Corbett, 1977). These images can be developed by the subject (Bower, 1972; 
Dempster 6c Rohwer, 1974; Paivio, 1971), or they can be given to the subject by 
the experimenter (Wollen 6c Lowry, 1974). 

What kinds of image are best? There is some evidence to suggest that interac¬ 
tive images are better than noninteractive images. For example, if you are attempt¬ 
ing to remember the pair horse—orange, it is probably better to imagine a horse 
eating an orange than it is to imagine a horse standing in one place and an orange 
lying in another spot (Bower, 1970; Robbins, Bray, Irvin, 6c Wise, 1974). It seems 
that interactive imagery is better than separation imagery, although both of these 
seem to be better than no imagery at all. 

However, some researchers (for example, Begg, 1978) have found that interac¬ 
tive imagery is not always superior to separate imagery. Furthermore, Nelson, 
Greene, Ronk, Hatchett, and Igl (1978) found that any interactive imagery was 
better than no interactive imagery but that increasing the number of interactive 
images did not further increase recall. They had one group of subjects make up a 
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(a) 



(b) 



Figure 11.7 Example of the kind of image that could be formed for the word pair 
boot—chair in each of the four conditions: (a) noninteractive image, (b) single interactive 
image, (c) homogeneous multiple interactive image, (d) heterogeneous multiple 
interactive image. (Adapted from Nelson, T.O., Greene, G., Ronk, B., Hatchett, G., and 
Igl, V. Effect of multiple images on associative learning. Memory and Cognition , 1978, 6, 
337-341. Fig. 1, p. 338.) 

noninteractive image such as the one in Figure 11.7a. A second group developed 
one single interactive image such as that in Figure 11.7b. Two additional groups 
developed multiple interactive images such as those in Figure 11.7c and 11.7d. It 
was found that all three interactive-imagery groups did better in a retention test 
but that they did not differ from one another. In other words, there appears to be 
a limit to how useful interactive imagery can be to us. One good interactive image 
seems to help us just as much as various kinds of multiple interactive image (see 
also Day & Bellezza, 1983). 

It is often asserted that making up strange, unusual, bizarre interactions will 
lead to better retention than will devising ordinary interactions. Although there is 
some evidence for this idea (see Lesgold & Goldman, 1973; Webber & Marshall, 
1978), the available data do not provide completely convincing support. For 
example, Wollen, Weber, & Lowry (1972) showed subjects pictures like those in 
Figure 11.8. The authors found that interaction was important but that bizarre¬ 
ness was not. That is, interacting pairs were recalled better than noninteracting 
pairs regardless of their degree of bizarreness. Bizarreness had no effect on recall, 
so the conclusion we must draw is that bizarreness may sometimes be helpful but 
that it is not always a critical variable. 

MNEMONICS 

Mnemonics are organizational or elaborative techniques that we can use to 
improve retention. Generally, mnemonics help us remember a new set of mate¬ 
rials by relating it to some old, well-learned set of materials. 
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Noninteracting, nonbizarre Noninteracting, bizarre 




Interacting, nonbizarre Interacting, bizarre 




Figure 11.8 Sample materials for the four conditions in an experiment on the bizarreness 
of imagery. (Adapted from Wollen, K. A., Weber, A., and Lowry, D. H. Bizarreness 
versus interaction of mental images as determinants of learning. Cognitive Psychology , 
1972, 2, 518-523.) 


Rhyming 

Many mnemonics involve visual images. Let us consider an example of the way in 
which a mnemonic technique involving visual imagery can facilitate retention. We 
begin with a little game, or mnemonic, for remembering lists of unrelated words 
(Miller, Galanter, 6 c Pribram, 1960). First, we must refresh ourselves with respect 
to an old children’s rhyme. Rehearse the rhyme contained in Table 11.5 a couple 
of times. Once you have this rhyme firmly in mind, you are ready to move to the 
next step, which is to learn a list of 10 unrelated words. The first word in the list is 
to be associated with “one is a bun.” The second word is to be associated with 
“two is a shoe,” and so on through the list. The first word is to be associated with 
“one is a bun” through the creation of some vivid visual image containing both a 
bun and the first word. For example, if the first word were house, you might 
imagine a house made of steaming hot-cross buns or a row of tiny green and red 
houses tucked neatly into a sliced bun. Each of the successive words in the list is to 
be involved with the appropriate rhyme element through the creation of some 
vivid image. Try this plan with the list of words contained in Table 11.6. Run 
through the list, one at a time. Give yourself enough time to establish a firm image 
for each word. 

Assuming that you have completed the task, we shall test the effectiveness of 
the mnemonic by asking a series of questions. Cover the list of words. What is the 
seventh word? What goes with door? What goes with wine ? What is the third 
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Table 11.5 Rhyme Used in Mnemonic Memory Plan 


One is a bun. 
Two is a shoe. 
Three is a tree. 
Four is a door. 
Five is a hive. 
Six is a stick. 
Seven is heaven. 
Eight is a gate. 
Nine is wine. 
Ten is a hen. 


Table 11.6 List of Words to be Used in Mnemonic Memory Plan 


CAR 

FLAG 

CLOCK 

BOOK 

KNIFE 

SHIRT 

DRUM 

SCISSORS 

PEN 

SKI 


word? What is the fifth word? In a subjective way you can probably sense what is 
occurring. The test words are so bound into your visual images that it is a simple 
matter to recall them. This plan for remembering has been tested experimentally. 
Bugelski (1968) had an experimental group learn lists of words using this plan 
and had control subjects learn the same lists without the plan. The experimental 
subjects recalled significantly more items than did the controls. In all likelihood, if 
you had not first learned the rhyme and been instructed in how to create images, 
your recall of the list would have been reduced. 

Inventing Stories 

Another type of mnemonic technique involves making up stories using the to-be- 
remembered words (Bower &c Clark, 1969). Consider the word lists in Table 11.7. 
One of the easiest ways to learn these lists is to invent a story containing all of the 
words in the target list. The stories on the right side of Table 11.7 enable us to 
remember not only the target words but the order of the words as well. The 
stories containing the words are easier to remember than the words alone. 

The Method of Loci 

There are many additional mnemonic techniques. One of them is called the 
method of loci and is described below by Houston, Bee, and Rimm (1983, pp. 
302-303). 

The method of loci works by placing mental images of what we are trying to 
remember in ordered mental locations. This method was invented by the 
ancient Greeks, who used it to help them remember topics and issues during 
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Table 11.7 Word Lists to Be Memorized and Used to Construct Stories 


WORD LISTS 

STORIES 

Bird 

A man dressed in a Bird 

Costume 

Costume and wearing a Mailbox 

Mailbox 

on his Head was seen leaping into 

Head 

the River. A Nurse ran out 

River 

of a nearby Theater and applied 

Nurse 

Wax to his Eyelids , but her 

Theater 

efforts were in vain. He died 

Wax 

and was tossed into the Furnace. 

Eyelid 


Furnace 


Rustler 

A Rustler lived in a Penthouse on 

Penthouse 

top of a Mountain. His specialty 

Mountain 

was the three-toed Sloth. 

Sloth 

He would take his captive animals 

Tavern 

to a Tavern , where he would 

Fuzz 

remove Fuzz from their Glands. 

Gland 

Unfortunately, all this exposure 

Antler 

to sloth fuzz caused him to 

Pencil 

grow Antlers. So he gave up his 

Vitamin 

profession and went to work in 


a Pencil factory. As a precaution 


he also took a lot of Vitamin E. 


Adapted from Houston, J. P., Bee, H., Hatfield, E., and Rimm, D. C. Invitation to psychology (2nd ed.). New York: Academic Press, 
1983. Table 3, p. 303. 


long public speeches. The first step in using the method of loci is to establish 
an ordered series of known locations. For example, you might take a “mental 
walk” through a very familiar building such as your home. A walk outside 
will do just as well, as long as it involves a series of known locations. At 
distinct points along your mental walk, you “place” vivid images of what you 
are trying to remember—the first item in the first location, the second in the 
second, and so on. 

For example, suppose you are trying to remember a list of items to be 
purchased at a hardware store. The items include nails, a hammer , a pair 
of pliers, a chisel, and a measuring tape. Now suppose you have chosen a 
“mental walk” through your house as a means of supplying convenient, 
well-known “locations,” or “hooks,” on which to place the images of the 
objects to be remembered. It has been suggested that this system works best if 
the images are vivid. Thus, if your walk begins at the front door, you might 
visualize the front door nailed shut with hundreds of green and maroon nails. 
Approaching the stairway, you might imagine a huge hammer with legs 
sliding down the bannister. In the kitchen you might have an enormous pair 
of pink pliers pinching the refrigerator in half. In the dining room you could 
visualize hundreds of little blue chisels buzzing about the food like flies. In the 
back yard, you might imagine a snakelike measuring tape devouring apples. 

Try this method for yourself using the words below. First, establish your 
mental walk. Then fill the locations with vivid images of the items. Then test 
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the system by “taking a walk” through your locations, and see how simple it 
is to recall not only the items but their correct order as well. 

Forest 

Mayor 

Pillar 

Rooster 

Tablespoon 

Gymnast 

Infant 

Shack 

Textbook 

Warplane 


The Keyword Method 

As a final example of mnemonic methods consider the technique described here 
by Houston, Bee, and Rimm (1983). 

The keyword method is a technique for remembering the English meaning of 
foreign words (Atkinson, 1975; Pressley & Levin, 1981). Suppose you want to 
remember that the Spanish word “carta” means “letter” The first step is to 
think of an English word that sounds like “carta,” such as “cart.” This is 
the keyword. Next, form a visual image of the keyword and the English 
translation, such as a mail carrier pulling a kiddy-cart full of letters. Later, 
when you are given “carta” and asked for the English meaning, “carta” will 
remind you of “cart” which will, in turn, key off the visual image containing 
the “letter” you are after, (pp. 304—305) 

ORGANIZATION AND ASSOCIATIONS 
This chapter about organization, elaboration, and imagery closes with a question 
that contrasts earlier stimulus—response conceptions of memory with more recent 
cognitive approaches. Does a more cognitive approach that emphasizes organiza¬ 
tion, elaboration, and imagery preclude a consideration of S—R associations? In 
one sense an organizational approach does seem to be concerned with elements 
that are not easily described in terms of S—R associations (images, structures, and 
the like). The organizational approach seems to be concerned with “cognitive” 
events that might better be described in terms of some new, nonassociative 
language. And yet, as Postman (1972) suggests, there is no necessary disagreement 
between associative and organizational conceptions of memory (see also Voss, 
1972). The two approaches clearly refer to two different ways of thinking and 
speaking about the same events. As Postman puts it, the differences between the 
two approaches appear to be a matter of language more than anything else. In a 
sense it seems that the organizational approach focuses on the complexity and the 
structure of the processes and events that intervene between input and output. 
The organizational psychologist speaks of these intervening mental events using a 
certain kind of language (for example, one that emphasizes such components as 
chunks, networks, and higher-order units). But these same mental events could 
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probably be translated into S—R language. It would merely be a matter of trying 

to identify extremely complex, interacting S—R relationships. 

SUMMARY 

1. Organization, structure, elaboration, and imagery are important concepts in 
the modern study of memory. 

2. Organizational activities refer to what we do to information between input 
and output. Although there is little agreement concerning an exact definition 
of organization, most conceptions refer to the fact that we often systematize, 
relate, structure, order, select, elaborate, and transform information. 

3. An interest in organization is not new. The issues of organization were raised 
by Gestalt psychologists many years ago. New methods of assessing orga¬ 
nization have renewed interest in their issues. 

4. In a word-association test the subjects’ task is to give the first word that 
comes to mind in response to a set of stimulus words. Some stimulus words 
tend to elicit common responses (for example, black—white). The patterns of 
obtained word associations suggest that words are somehow structured, or 
organized, within us. 

5. As children grow older, their word associations become more semantic and 
less episodic. 

6. Word-association tests have been used in clinical settings in attempts to 
reveal the patients’ emotional and cognitive states. 

7. The capacity of immediate memory is probably limited by the number of 
chunks of information it can hold (7 ± 2). 

8. A chunk may vary in terms of the amount of information that it contains. 

9. In general, as long as information can be expressed as a single response, it 
represents a single chunk. 

10. We can increase the capacity of our immediate memory by increasing the 
amount of information contained in our chunks of information. 

11. Response times for the production of responses relating the members of sets 
to one another show a definite break at about six items. 

12. Category clustering in recall refers to the fact that items drawn from various 
categories and presented randomly will often be recalled together. 

13. Although the issue is not completely resolved, there is some evidence to 
suggest that overall recall increases as clustering increases. 

14. Total recall and clustering both seem to be greater for exhaustive than for 
nonexhaustive categories. 

15. In subjective organization experiments, lists of items are randomly presented 
and recalled over and over. It is found that the order of recall takes on 
consistency despite random input. 

16. Subjective organization experiments do not provide us with much informa¬ 
tion concerning the nature of the evolving organization. 

17. The relationship between subjective organization and overall recall has not 
been clearly determined. 

18. Stimulus selection refers to the fact that the organism does not always use the 
entire stimulus presented to it. Sometimes we select some portion of the 
presented stimulus and associate the response with that portion. 
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19. Many factors, including ordinal position, have been shown to influence 
stimulus selection. 

20. Animals as well as humans display stimulus selection. This represents 
another similarity between animal and human learning processes. 

21. Mental images are the subject of much current investigation. Imagery can be 
shown to improve retention. 

22. The results of many scanning experiments support the idea that we do 
encode, store, and use visionlike mental images. 

23. However, the fact that congenitally blind'subjects can also display typical 
scanning-experiment results requires that we be cautious in our interpreta¬ 
tion of scanning results. 

24. Mental-rotation experiments, also taken as support for visual imagery, show 
that the speed of matching two objects decreases as the difference in orienta¬ 
tion of the two objects increases. 

25. Interference effects in imagery also support the idea of a “pictures-in-the- 
head” interpretation of imagery. 

26. Imagery has been interpreted according to a perceptual-experience hypoth¬ 
esis and a propositional interpretation. 

27. According to the dual-trace hypothesis, both concrete and abstract items are 
likely to be coded verbally. But concrete items are more likely to be coded 
visually than are abstract items. 

28. Although the distinction does not always hold true, some investigators have 
suggested that spatially distributed information is often encoded visually, 
whereas temporally distributed information is often encoded verbally. 

29. It seems likely that our natural encoding efforts probably involve both verbal 
and visual components. 

30. It has been found that better learners elaborate more than slow learners. In 
addition the elaborations developed by fast learners help new learners more 
than do the elaborations of slow learners. 

31. Memory for pictures is usually better than memory for the names of those 
pictures. Some feel that pictures are processed and stored in a manner that is 
fundamentally different from the way in which we store verbal stimuli; 
others do not. 

32. Paired associates are retained better when the stimuli and responses are 
related to each other through the creation of some visual image. 

33. Interactive images appear to be more effective than noninteractive images. 

34. Bizarreness of image may sometimes be helpful, but not always. 

35. There is a limit to how helpful interactive imagery can be. 

36. Mnemonics are techniques we use to improve retention. Rhyming and the 
composition of stories using to-be-remembered items can be helpful. 

37. Other mnemonic techniques include the method of loci and the keyword 
method. 

38. The organizational approach does not necessarily preclude a consideration of 
S—R associations. 
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PARI 

FIVE 

COGNITIVE 

PROCESSES 


R ecent years have witnessed a 
sharp resurgence of interest in 
cognitive processes. Essentially, 
cognitive psychologists are interested 
in studying and understanding com¬ 
plex mental events. They are not 
afraid to talk about ideas, images, 
thoughts, symbols, knowledge, and 
reasoning. In fact, it is just these elu¬ 
sive, complex mental events that in¬ 
trigue cognitive psychologists. They 
do not want to be limited to a consid¬ 
eration of strictly observable events; 
they want to “look inside the head” 
and understand our mental tools 
and manipulations. Whereas Hull, 
some decades ago, argued that to 
maintain rigor we must avoid a con¬ 
sideration of “mentalistic” events, 
modern cognitive psychologists argue 
that we can, after all, understand 
complex mental events even though 
they cannot be observed directly. 
They feel that, although Hull and the 
behaviorists may have been right in 
their time, our new experimental tech¬ 
niques and our ever-increasing store 


of analytical tools now enable us to 
return to many interesting questions 
about thought and thinking that have 
been neglected for some time. 

Because cognitive psychology is 
new, or at least in the ascendance, it 
should not be surprising to learn that 
there is a good deal of disagreement 
over what is meant by cognition. 
Houston, Bee, and Rimm (1983) point 
out that there are at least five popular 
ways of looking at cognition: 

1. Cognition as information pro¬ 
cessing. Many psychologists think of 
cognition as the overall processing of 
information. According to Neisser 
(1967), cognition is an inclusive term 
that refers to all the processes by 
which sensory input is transformed, 
reduced, elaborated, stored, retrieved, 
and used. Although very broad and 
general, this definition does have 
value, because it underlines the fact 
that many cognitive psychologists 
think about thinking in terms of in¬ 
formation processing. They use the 
language of computers. In fact, some 
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have attempted to construct com¬ 
puter programs that will simulate 
human thinking. These efforts usu¬ 
ally fall under the heading of artificial 
intelligence. 

2. Cognition as manipulation of 
mental symbols. Some psychologists 
prefer to think of cognitive events as 
the manipulation of mental symbols. 
As we have seen, a symbol is anything 
that stands for, or represents, some¬ 
thing else. Therefore, when I ask you 
to think of a dog and you do so, 
whatever it is that you are thinking of 
is a symbol for dog. Symbols free us 
from being trapped in the present. 
They allow us to make excursions 
into the past and future. For exam¬ 
ple, when you think of the Boston 
Tea Party, you are visiting the past 
through the use of symbols. When 
you think about the next presidential 
election, you are moving into the fu¬ 
ture in a way that would not be possi¬ 
ble without symbols. 

3. Cognition as problem solving. 
As you will recall, we mentioned 
problem solving back in Chapter 1. A 
problem exists when a motivated 
organism is blocked from attaining a 
goal by an obstacle or obstacles. 
Some cognitive psychologists like to 
think of problem solving as the es¬ 
sence of cognition. They feel that 
cognition refers to the ways in which 
we gather and use information in the 
pursuit of the solution to problems. 
Obviously, some cognitive events, 
such as finding the answer to a mathe¬ 
matical question, involve problem 
solving. But the real question is 
whether all cognitive events are of the 
problem-solving variety. For exam¬ 
ple, suppose that the name or face of 
an old friend “pops into your mind.” 
Was this a case of problem solving? It 


may well be some subtle form of 
problem solving, such as satisfying a 
fantasy wish or avoiding boredom. 
But it seems more likely that problem 
solving may be one form of cognitive 
event, rather than the other way 
around. Still, there are some who 
hold to the position that all cognition 
is problem solving. 

4. Cognition as thinking. Most 
cognitive psychologists would agree 
that their primary interest is in the 
area of thinking. A firm definition of 
thinking is currently unavailable in 
the field of psychology. Stating that 
cognition is thinking does not get 
us very far. For heuristic purposes, 
however, drawing a parallel between 
thinking and cognition can be helpful. 
The term thinking certainly covers an 
enormous range of very complex 
events, and most of them seem to be 
described equally well as cognitive 
events. 

5. Cognition as knowing, remem¬ 
bering, perceiving, thinking, judging, 
problem solving, reasoning, learning, 
imagining, conceptualizing, and using 
language. According to this ap¬ 
proach, the best way to think about 
cognition is to list every kind of event 
that might possibly be considered 
cognition and let the matter go at 
that. No attempt is made to be more 
precise. At this early stage in the inves¬ 
tigation of cognition, this approach 
has merit. 

These five approaches are not iso¬ 
lated and distinct ways of viewing 
cognition. Rather, each seems to em¬ 
phasize some salient quality of the 
overall concept of cognition. When 
thinking about cognition, it is a good 
idea to keep them all in mind, for 
each has something to offer. 

It should also be clear by now that 
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we have already begun our study of 
cognition in many of the preceding 
chapters. We have already looked at 
topics such as memory, organization, 
information processing, and so on, all 
of which clearly represent issues that 
are of interest to the cognitive psy¬ 
chologist. We have been considering 
cognition all along. You cannot really 
think about learning and memory 
without thinking about cognition. 

What we will do in the next two 


chapters is consider some topics that 
are often mentioned as being of spe¬ 
cial concern to the cognitive psychol¬ 
ogist. Thus, in Chapter 12 we shall 
discuss concept formation and prob¬ 
lem solving. In Chapter 13 we shall 
address the wonderful area of lan¬ 
guage usage. Remember that these are 
selected topics; they do not exhaust 
the concerns of the cognitive psychol¬ 
ogist. But they are representative of 
the cognitive approach. 
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WHAT IS CONCEPT FORMATION? 

Defining Concepts 

In Chapter 1 we noted that a concept is a symbol or group of symbols that stands 
for a class, or group, of objects or events that possess common properties. Thus, 
house is a concept because it is a symbol that stands for a large group of objects, 
all of which possess common characteristics. Most words with the exception of 
proper nouns, which refer to only a single object, are concepts. Concepts need not 
involve language. For example, babies can have a concept of mother long before 
they have acquired language. Concepts help us to think efficiently, because they 
free us from having to give a unique label to each new instance of a concept. For 
example, if we see a new car, we do not have to give it a unique label. We just 
label it as one more car. Concepts are so ingrained in our thought processes that it 
is difficult to imagine what life would be like without them. 

Concepts are related to one another in complicated ways. For example, a 
concept can be an instance of another, broader concept. Thus, bee is a concept, 
but it is also an instance of the concept insect , which, in turn, is an instance of the 
concept animal. 


Defining Concept Formation 

Concept formation, which is of particular concern to learning psychologists, 
refers to how we go about learning or acquiring concepts. Specifically, psychol¬ 
ogists are interested in understanding how we learn to identify objects or events as 
examples of particular concepts. When we see a baby that we have never seen 
before, we readily identify it as an example of the concept baby. When, where, 
and how did we learn this complex ability to correctly categorize so many objects 
and events? These questions intrigue the cognitive psychologist and lead to the 
experiments that we will consider in this chapter. 

Concepts and Problems 

In a sense concept formation can be thought of as an example of problem solving. 
The problem is to learn the concept, or to acquire the ability to correctly identify 
examples of the concept. So we are dealing essentially with a simple form of 
problem solving when we discuss concept formation. Later in this chapter we 
shall turn to more complicated problems. First we shall deal with simple concepts, 
then complex concepts, and then the issues usually discussed in connection with 
the broader area of problem solving. 

A Sample Experiment 

Let us begin our study of concepts by considering a “typical” experimental 
design. 

1. Reed (1972) taught subjects concepts based on schematic faces similar to 
those in Figure 12.1; the attributes of these faces are face shape (narrow or wide), 
eye position (high or low), and mouth shape (smile or frown). The stimuli for the 
experiment are defined by selecting some number of attributes—ways in which 
the stimuli can differ—and constructing all the possible stimuli from selected 
values of these attributes. 

2. The experimenter defines the concept to be learned by choosing one or more 
of the attribute values—perhaps a wide face. All stimuli that have wide faces are 
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Figure 12.1 Face stimuli similar to those used by Reed (1972) in studying concept 
formation. The faces vary on the dimensions of face shape (wide or narrow), eye position 
(high or low), and mouth shape (smile or frown). 

then instances of the concept to be learned. The other attribute values—eye 
position and mouth shape—do not affect category membership and are called 
irrelevant attributes. (Note that the experimenter could also define as a concept a 
complex combination of the attributes—perhaps a narrow face with both a smile 
and high eyes.) 

3. Subjects are shown the stimuli in some order, usually random, and classify 
each stimulus either as being or not being an instance of the concept. Feedback is 
generally given after each trial. Stimuli are shown to the subjects until some 
criterion of performance is met, usually some number of consecutive correct 
responses. By classifying many stimuli correctly, the subjects are assumed to have 
learned the concept. 

This simple experimental design can be very useful in exploring many different 
aspects of concept formation. For example, we might be interested in knowing 
how the number of attributes affects concept learning. So we would have one 
group of subjects learn the concept wide face while we simultaneously varied only 
one other attribute (for example, eye position); another group would learn the 
same wide face concept while we varied two or more other attributes (for 
example, eye position and mouth shape). Presumably we would find that the 
concept would be easier to learn when fewer irrelevant attributes were varied; the 
situation would be simpler and easier to “figure out.” 

DISCOVERING ATTRIBUTES AND RULES 
Concept-formation tasks often require us to make two kinds of discovery. First, 
we must decide which attributes are relevant, and second, we must discover the 
rules that tell us how to combine these attributes. As we have seen, attributes are 
ways that stimuli can differ from one another, such as size, shape, and color. 
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Answers 


1 . 


2 . 


3. 



4. 



5. 



6 . 



7. 



8 . 



Yes 

Yes 

No 

No 

No 

Yes 

No 

Yes 


Answer: sMOjqaAa 'psjdue-djeqs ueLp jdifiej 'papunoj si jdoouoo am 


Figure 12.2 An attribute discovery task. Before you begin, place a piece of paper so that 
it hides the correct answers on the right-hand side of the page. Looking only at the 
figures on the left-hand side, say yes if you believe that the figure is an example of the 
concept and no if you believe that it is not. Only one attribute is relevant in this task. 

After judging each pair move the paper down to see the answer. Continue until you think 
you know what the concept is, and then check below to see whether you are correct. 
(From Cognition by Margaret Matlin. Copyright © 1983 by CBS College Publishing. 
Reprinted by permission of Holt, Rinehart and Winston, CBS College Publishing.) 

Rules tell us how these attributes are related to one another. For example, 
suppose you found that smiling, wide faces and frowning, narrow faces were 
correct stimuli in Figure 12.1. By coming to realize that these stimuli were correct 
you would have discovered that face shape and mouth shape were the relevant 
attributes, whereas eye position was irrelevant. But you would also have learned a 
rule telling you how to combine the attributes to identify instances: “Only 
frowning narrow faces and smiling wide faces are correct.” 

Interestingly, any given experiment can involve attribute discovery alone, rule 
learning alone, or both. Let us look at each of these subprocesses individually. 

Attribute Learning 

Matlin (1983) provides a nice example of an attribute-discovery task where no 
rule learning is required. Try the demonstration in Figure 12.2 before reading any 
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Yes 


Yes 


No 
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MopuiM e 9Aeq lsniu \\ ueqi 'Aauaijqo e seq 0jn6q e :e|m leuoqipuoo 


Figure 123 A rule-discovery task. Before you begin, place a piece of paper so that it 
hides the correct answers on the right-hand side of the page. Looking only at the figures 
on the left-hand side, say yes if you believe that the figure is an example of the concept 
and no if you believe that it is not. The attributes presence or absence of chimney and 
presence or absence of window are relevant. Your task is to figure out the rule for 
combining these attributes so that you can decide which figures are examples of the 
concept. After judging each item, move the paper to see the answer. Continue until you 
think you know what the concept is, and then check to see whether you are correct. 
(From Cognition by Margaret Matlin. Copyright © 1983 by CBS College Publishing. 
Reprinted by permission of Holt, Rinehart and Winston, CBS College Publishing.) 


further. As you will realize, eyebrow shape is the relevant attribute (rounded 
rather than peaked eyebrows are instances of the concept). At the same time you 
will realize that this task did not require you to learn a rule relating the attributes 
to one another. 


Rule Learning 

Matlin (1983) also provides a good example of a concept-formation task that 
requires you to learn a rule when the relevant attributes are provided. Try the 
demonstration in Figure 12.3 before reading further. You will experience what we 
mean by rule learning in the absence of attribute discovery. 
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Combined Learning 

Obviously, many, perhaps most, concept-formation tasks require us to discover 
relevant attributes and rules relating those attributes. Go back to the task in 
Figure 12.3, for example, and imagine trying to solve it if you had not first been 
told that chimneys and windows were the relevant attributes. The task would 
have been very much more difficult. 

Rule Complexity 

Thinking about concepts in terms of rules seems to have helped psychologists (see 
Anglin, 1977; Fodor, 1975; McCloskey & Glucksberg, 1978; Rosch, 1978). They 
feel comfortable with this kind of language. One of the obvious things that they 
have considered is the fact that concept-defining rules can vary enormously in 
terms of their complexity. Psychologists want to know whether this is important. 
Are concepts that are defined by simple rules any easier to learn than concepts 
defined by more complex rules? To answer this question imagine that you are a 
subject in an experiment. You are shown a series of cards, each of which contains 
one circle and one square. The circles and squares may be either red or green. For 
each card you must choose the item that is “correct,” or belongs to the category 
that the psychologist has in mind. Now the psychologist may have a very simple 
concept in mind, such as anything that is red. In this case you will quickly learn 
the concept and always choose red items, whether circles or squares. 

But the psychologist can make the task more difficult. For instance, she might 
define the concept you are after as all green circles. In this situation you would 
have to learn to pick only green circles and to refrain from choosing green 
squares, red squares, and red circles. Finally, the psychologist could complicate 
the situation even more by defining the concept as red and not square, or square 
and not red. This rule demands that you choose only red circles and green 
squares. 

Attempting to determine whether concepts defined by complex rules are harder 
to master than concepts defined by simple rules, Haygood and Bourne (1965) 
developed a series of rules that increase in logical complexity (see also Bruner, 
Goodnow, & Austin, 1956; Neisser & Weene, 1962). These concept-defining 
rules are listed in Table 12.1. Complexity increases as one moves down the table 
from the top (on either the left or right side—take your pick). The researchers 
predicted, reasonably enough, that concept learning would become more difficult 
as rule complexity increased, and, in fact, this is essentially what they found. 
Concepts defined by the rules at the bottom of Table 12.1 were more difficult to 
master than concepts higher up in the table. 

INTERPRETATIONS OF CONCEPT FORMATION 

Psychologists have always been interested in how concept learning occurs. In this 
chapter we shall touch on several distinct interpretations of concept formation. 

The Early S—R Interpretation 

For the moment we want to focus upon one of the very earliest interpretations. 
Specifically, the behaviorists, including Hull, felt that concept learning could be 
best understood as an instance of discrimination learning. Hull and others 
noticed a similarity between the concept-learning situation and the discrimina- 
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Table 12.1 Rule-Defining Concepts of Increasing Logical Complexity 


BASIC RULES _ _ COMPLEMENTARY RULES 

SYMBOLIC SYMBOLIC 


NAME 

DESCRIP¬ 

TION 

VERBAL 

DESCRIPTION 

NAME 

DESCRIP¬ 

TION 

VERBAL 

DESCRIPTION 

Affirmation 

R 

Anything that is red 
is an example. 

Negation 

R 

Anything not red is 
an example. 

Conjunction 

RHS 

All red squares. 

Alternative 

denial 

R US 

All items either not 
red or not square. 

Inclusive 

disjunction 

R U S 

All items either red 
or square or both. 

Joint denial 

RHS 

All items neither red 
nor square. 

Conditional 

R-S 

Everything green; 
also, if red, item 
must be square. 

Exclusion 

RH S 

All items red and 
not square. 

Biconditional 

R 

Red items, if they 
are square; square 
items if red. 

Exclusive 

disjunction 

rDs 

All items are either 
red or square but 
not both. 


*As one moves down the table (on either the left or right side), the rules become more complex. In general, the concepts defined by 
these rules are more difficult to master. 

Adapted from Haygood, R. C M and Bourne, L. E. Attribute- and rule-learning aspects of conceptual behavior. Psychological Review, 
1965, 72, 175-195. Fig. 1, p. 178. Copyright 1965 by the American Psychological Association. Reprinted by permission. 


tion-learning situation. In a discrimination-learning task, if you will recall, a 
pigeon might have to learn to discriminate among various colored lights, learning 
to respond to only one of them. Similarly, in the concept-learning situation the 
human must learn to discriminate among a number of different stimulus values, 
learning to respond to only one of them. 

This behaviorist, or stimulus-response (S—R), view was supported by one of the 
classic experiments in concept learning, carried out by Hull (1920). In this 
experiment subjects were trained to classify Chinese alphabet characters into 
different categories, depending on the presence of a certain “radical,” or line 
segment, used in the construction of the character (Figure 12.4). Although many 
of Hull’s subjects were able to learn the task, they were unable to explain bow 
they were classifying the characters. Hull was very impressed by this inability of 
subjects to describe their performance, and he stressed the importance of analyz¬ 
ing the subjects’ behavior and not their introspections. 

Hull’s experiment fit quite nicely into the behaviorist view that was developing 
at the time; not only was introspection unscientific, but the subjects could not 
even do it! This kind of result reinforced the behaviorists’ notion that we should 
look only at observable behavior and avoid speculations about what goes on 
“inside the head.” 


The Hypothesis-Testing Interpretation 
Beginning in the 1950s, however, psychologists began to depart from the strict 
behaviorist position. They began asking questions about what complex mental 
processes are involved in concept formation. This movement toward modern 
cognitive psychology brought with it a resurgence of interest in introspection, 
tempered by the careful evaluation of these introspections against experimental 
data. Out of this movement arose the idea that concept formation involves 
hypothesis testing, something that Hull would never have considered because it 
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Figure 12.4 Some of the materials used by Hull in demonstrating concept formation. 

Each of the six different “radicals,” or concepts, is imbedded in all of the characters in its 
row. The subjects first were given the characters in List 1 and asked to associate a 
different nonsense syllable with each one. Then they were given the second list and asked 
to again associate the same nonsense syllables with these new characters. In the second 
list the nonsense syllable was paired with the second-list character, which contained the 
same radical that appeared in the first-list character paired with that syllable. Then they 
learned the third list, again being required to associate the nonsense syllables with the 
characters that contained the “correct” radicals. By the time the sixth list was reached, 
the subjects were very good at giving the correct nonsense syllables on the very first 
presentation of the characters, even though they had little idea of how they could 
do it. (Adapted from Hull, C. L. Quantitative aspects of the evolution of concepts. 
Psychological Monographs, Whole No. 123, 1920. Copyright 1920 by the American 
Psychological Association.) 

involves thinking about what goes on in the mind. The hypothesis-testing inter¬ 
pretation has pretty well supplanted the older discrimination-learning interpreta¬ 
tion, and it remains very influential today. 

Instead of viewing concept learning as a somewhat more advanced case of 
discrimination learning, many psychologists see concept learning as a process of 
testing hypotheses—guesses about what the true classification rule might be. The 
subject is assumed to select one of the hypotheses and use it to classify the stimuli. 
For example, consider the stimuli in Figure 12.1. There are six simple hypotheses: 
wide face, narrow face, high eyes, low eyes, smiling, and frowning. Subjects 
might begin an experiment by guessing that the hypothesis wide face will correct¬ 
ly classify the stimuli. If the rule that correctly defines the concept is smile, they 
would respond to the stimuli as shown in Figure 12.5. 
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Figure 12.5 A sample set of stimuli, subject responses to those stimuli based on the 
hypothesis that wide is the concept and feedback given to the subject on each of three 
trials. The correct hypothesis is smile, not wide. 

On Trials 1 and 2 wide face seems to work. This, however, is only because thus 
far wide face has appeared only with smile and has not appeared without smile. 
In Trial 3, when wide face and frown appear together, the subject’s hypothesis 
leads to an incorrect response. After Trial 3 the subject knows that wide face is 
not the correct hypothesis and selects a new hypothesis to replace the old, 
incorrect one. This, too, may be incorrect, and yet another hypothesis will have to 
be tried. This process will continue until the subject finally selects the proper 
hypothesis [smile) and makes a long string of correct responses, terminating the 
experiment. 

CONSERVATIVE FOCUSING. In a classic paper Bruner, Goodnow, and Austin 
(1956) reported a series of experiments that described hypothesis-testing proce¬ 
dures in great detail. Although no specific theory was presented to account for the 
results, Bruner and his co-workers isolated many of the critical questions that any 
theory of concept learning must confront, and they described many methods used 
by their subjects to test hypotheses (see also Johnson, 1978). One of these 
methods is particularly interesting, because it represents the optimal strategy for 
concept learning. Bruner, Goodnow, and Austin used a slightly different tech¬ 
nique from the one described earlier. Rather than being shown the stimuli in 
random order, the subjects told the experimenter which stimulus they wanted to 
have classified: “Is the wide, low-eye, smiling face an instance of the concept?” 
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With this ability to select stimuli for classification, the subjects used a strategy 
called conservative focusing. In conservative focusing the subject takes a very 
simple, direct approach and focuses on a single attribute at a time, checking it out 
completely before moving on to another dimension—hence, the label. 

For instance, consider the example cited above involving faces. Suppose that 
the correct hypothesis is smile. A subject begins his task by selecting and focusing 
on one guess, or hypothesis. Let us say that he guesses that wide face is correct. 
He begins by asking whether wide face—low eyes—smile is an instance of the 
correct category. He will be told that it is. Then; because he is guessing that wide 
face is correct, he changes wide face to narrow face , leaving the other two 
attributes alone. Specifically, he asks about narrow face—low eyes—smile. Again 
he is told that he is correct. So much for his first guess that wide face is the 
concept. He now knows that wide face cannot be the concept, because if it were, 
narrow face—low eyes—smile would not belong to the correct category. So he 
changes his focus. Having eliminated the ivide-narrow attribute, he turns to one 
of the two remaining attributes. Let us say that he guesses that smile is correct. To 
test this guess he varies that single attribute, holding the other two constant, and 
asks about narrow face-low eyes—frown. He is told that this is not an instance 
of the category, and he has the answer! Smile must be the concept, because 
instances with it included are correct and instances without it are incorrect. He 
has found the concept by focusing on and eliminating one attribute at a time. 

If the subject adopts this strategy in an experiment such as Bruner, Goodnow, 
and Austin’s, very good performance is guaranteed. These conditions, however, 
are not typical of much of the work in concept learning. If category membership 
depends on two attribute values—say, a wide, smiling face—conservative focus¬ 
ing will not work. In addition, most experiments use random presentation of 
stimuli, so that some time may pass before a stimulus is presented that will allow 
the use of conservative focusing. By that time the subject may have forgotten the 
stimulus that was originally selected or, perhaps, which of the attributes had been 
shown to be irrelevant to category membership. Performance on most concept¬ 
learning experiments is therefore almost always well below the ideal specified by 
conservative focusing. Conservative focusing is an optimal strategy only when the 
subject can select the stimuli to be evaluated and when a single dimension is 
involved. 

“WATCHING” PEOPLE USE HYPOTHESES: BLANK TRIALS. Levine (1966, 1975) de¬ 
vised an ingenious experimental method that enables us to do what all previous 
research had failed to do. Specifically, Levine’s design enables us not only to tell 
whether a subject is using a hypothesis but also to determine exactly what that 
hypothesis is and ivhen the subject changes from that hypothesis to a new one. 

Subjects were shown a series of cards, each of which contained two letters. The 
subjects’ task was to choose one of the two letters on each card. The letters 
differed in color (black or white), letter (X or T), size (large or small), and 
position (left or right side of card) (see Figure 12.6). The correct hypothesis was 
one of the eight simple possibilities (small, large, black, white, T, X, left, right). 
On the first trial the subjects were shown a card and told which of the two letters 
was correct. But then Levine did something a little different. Subjects were given a 
series of four “blank trials” during which they had to choose one of the letters, 
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Figure 12.6 The eight response patterns in Levine’s (1966) experiment and the 
hypotheses indicated by those patterns. (Adapted from Levine, M. Hypothesis behavior 
by humans during discrimination learning. Journal of Experimental Psychology , 1966, 

71, 331—338. Fig. 1, p. 331. Copyright 1966 by the American Psychological Association. 
Reprinted by permission.) 

but no feedback was given during these blank trials. The assumption was that, 
although subjects might change their hypothesis following a trial with feedback, 
they would not change during the blank trials because no information about the 
correctness of their responses was given. Furthermore, these blank trials were 
specially designed so that, if the subject was using a specific hypothesis, that 
hypothesis could be accurately detected by the pattern of responses that he or she 
gave over the four blank trials. 

The specially selected stimuli that were presented during the four blank trials 
are shown in Figure 12.6. Notice that the patterns of responses for the eight 
different hypotheses are all different. By matching the subject’s actual responses 
against these patterns we can determine which of the eight hypotheses the subject 
was actually using. For example, if a subject gave T T T T over the four trials, we 
can be sure that he was guessing that T was the correct hypothesis. Similarly, if a 
subject responded with X T X T, we can be sure she was using the hypothesis 
large. In addition, if a subject gave a pattern of responses other than those in 
Figure 12.6, we can be sure that none of the eight simple hypotheses was being 
used consistently in all four blank trials. This is an ingenious and very clever way 
to determine what people are guessing without having to ask them. 

What did Levine learn using the “blank trial” method? First, he found that in 
over 90% of the blank trial series one or another of the eight hypotheses was 
being used. In other words, people did not just skip around trying any old thing 
on any given trial. They did, after all, pick one hypothesis and stick to it through 
the four blank trials. They did not switch from one hypothesis to another during 
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the blank series. Second, Levine found that subjects almost always stuck with that 
same hypothesis on the feedback trial following the four blank trials. They held to 
their “game plan” and used the same hypothesis that they had been using during 
the blank trials on the feedback trial following the blank trials. Third, by looking 
at performance on successive series of four blank trials, interspersed among 
feedback trials, it was possible to note when subjects changed hypotheses and 
what new hypotheses they changed to. For example, if a subject gave T T T T on 
one set of four blank trials and then, after another feedback trial, gave T X T X on 
another series of four blank trials, it can be inferred that the subject shifted from 
testing the hypothesis that T was correct to testing the hypothesis that small was 
correct. Fourth, Levine was able to show that, as the experiment progressed, 
subjects eliminated incorrect hypotheses and no longer considered them. When 
they tried a hypothesis and found it not to work, they did not put it back in the 
“pool” of available hypotheses and did not test it again later. Although they were 
not perfect at doing this, they remembered which hypotheses did not work, and 
they did not, for the most part, waste time in testing them again. In summary, 
Levine’s work suggests very strongly that hypothesis testing plays an important 
part in concept learning and that humans are fairly methodical and careful in 
their hypothesis-testing efforts. 

Natural Categories 

Many psychologists have argued that the typical concept-formation tasks used in 
our laboratories are very artificial and quite unrelated to the concepts we develop 
and utilize outside the confines of the laboratory. Eleanor Rosch, at the University 
of California, Berkeley, was one of the first to raise and explore this criticism 
leveled at laboratory concept tasks (Rosch, 1973). She and others point out that 
by studying such artificial concepts as red squares or wide frowns we are learning 
very little about the complex concepts we actually encounter in everyday life. 

There are two main criticisms of the laboratory approach. First, natural cate¬ 
gories are hierarchical in the sense that larger categories often contain smaller 
categories. For instance, bees are insects, which are, in turn, animals. Most early 
laboratory experiments did not take these critical hierarchical relationships into 
account. Second, laboratory work failed to account for the fact that some in¬ 
stances of certain categories are more representative of that category than are 
other instances (Medin, Dewey, & Murphy, 1983; Roth & Mervis, 1983). For 
example, robin is more representative of the category bird than is penguin. In 
traditional laboratory tasks this important fact was not taken into account. The 
experimenter arranged the materials and tasks without regard for these variations 
in representativeness. 

There are other criticisms of the traditional lab approach. Barsalou (1983) 
points out that to truly understand concepts we must acknowledge such facts as 
that certain categories like things to sell at a garage sale are useful and are used by 
us but that they are not as well established in memory as a concept like fruit. But 
the two criticisms we have mentioned (hierarchical structure and variations in the 
representativeness of instances) deserve further comment individually. 

HIERARCHICAL ORGANIZATION. Rosch, Mervis, Gray, Johnson, and Boyes- 
Braem (1976) argued that the hierarchical nature of naturally occurring categories 
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Table 12.2 Examples of Subordinate, Basic, and Superordinate Categories 


SUPERORDINATE BASIC LEVEL 


SUBORDINATES 

Musical 

Guitar 

Folk guitar 

Classical guitar 

instrument 

Piano 

Grand piano 

Upright piano 


Drum 

Kettle arum 

Bass drum 

Fruit 

Apple 

Delicious apple 

Mackintosh apple 


Peach 

Freestone peach 

Cling peach 


Grapes 

Concord grapes 

Green seedless grapes 

Tool 

Hammer 

Ball-peen hammer 

Claw hammer 


Saw 

Hack hand saw 

Cross-cutting hand saw 


Screwdriver 

Phillips screwdriver Regular screwdriver 

Clothing 

Pants 

Levi’s 

Double-knit pants 


Socks 

Knee socks 

Ankle socks 


Shirt 

Dress shirt 

Knit shirt 

Furniture 

Table 

Kitchen table 

Dining-room table 


Lamp 

Floor lamp 

Desk lamp 


Chair 

Kitchen chair 

Living-room chair 

Vehicle 

Car 

Sports car 

Four-door sedan car 


Bus 

City bus 

Cross-country bus 


Truck 

Pickup truck 

Tractor-trailer truck 


From Rosch, E. H., Mervis, C. B., Gray, W. D., Johnson, D. M., and Boyes-Braem, P. Basic objects in natural categories. Cognitive 
Psychology \ 1976, S, 382-440. Copyright 1976 by Academic Press, Inc. Reprinted by permission. 


can best be understood by referring to the three levels depicted in Table 12.2. 
These levels reflect the fact that any given object can belong to several different 
but related categories. For example, you can refer to what you play as your 
musical instrument, your guitar, or your folk guitar. Or you can refer to what you 
drive as your vehicle, your truck, or your pickup truck. 

Superordinate category levels are the most general. Fruit, tools, and clothing 
are examples of superordinate categories. Subordinate category levels are the 
most specific, including examples such as Delicious apples, ball-peen hammer, 
and Levi’s. But notice that the intertnediate category levels, containing examples 
such as apple, hammer, and pants are called basic-level categories by Rosch and 
her colleagues. They feel that this category level is the most important of the 
three—hence, the label basic level. They believe that it is at this level that 
categories carry the most information and are the most different from one 
another. This is the first of the three types of category we learn and is the most 
important in language (Reed, 1982). 

Rosch, Mervis, Gray, Johnson, and Boyes-Braem (1976) identified some of the 
characteristics that make basic-level categories important: 

1. Common attributes. Members of a given basic-level category have many 
attributes in common. For example, consider the basic-level category pants. You 
can think of a lot of attributes that all pants have in common, including legs, belt 
loops, waist, cloth, pockets, and so on. But now try to think of attributes that all 
instances of the corresponding superordinate category clothing have in common. 
It’s not so easy, is it? And finally, try to think of attributes that all instances of the 
subordinate category Levi’s have in common, above and beyond what is already 
included in the basic-level category pants. The point here is that most of the 
information is contained in the basic level. 

2. Common motor movements. Instances of basic-level categories seem to 
have many motor movements in common. For example, we all put our pants on 
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the same way, regardless of what kind of pants they are. Instances of superordi¬ 
nate categories have few movements in common. Specifically, can you think of 
any movements that are common to all examples of clothing? And adding a 
subordinate category doesn’t add much new information about motor move¬ 
ments that is not already contained in the basic-level category (that is, we put on 
Levi’s the same way we put on our double-knits). The point is that there is a big 
increase in common motor movements as we move from superordinate to basic 
but a very small increase, if any at all, as we move from basic to subordinate. 

3. Common shapes. All instances of superordinate categories don’t have simi¬ 
lar shapes. For example, all examples of clothing do not look alike. But at the 
basic level there is a great deal of overlap in the shapes of all samples (for 
example, most pants look quite a bit alike). Examples at the subordinate level also 
show considerable overlap in shape, but not much more than at the basic level. 
Once again, there is a big increase in common shape when we move from 
superordinate ( clothing) to basic [pants] but not much further common shape as 
we move to subordinate (Levi’s). 

4. Object identification. What do you call what you are sitting on right now? 
You call it a chair. You don’t call it a piece of furniture , and you don’t call it a 
library chair or a wingback chair. Rosch and her co-workers found that people 
prefer to identify an object by its basic-level name. 

In summary, basic-level categories seem to be the most important of the three 
levels because examples at this level have many common attributes, have common 
associated motor movements, have common shapes, and tend to be used in 
identifying objects. Rosch’s analyses of the complexity of the hierarchical rela¬ 
tionships of categories is all part and parcel of the new look in concept research, 
which emphasizes natural rather than artificially defined concepts. 

FAMILY RESEMBLANCES. If you will recall, the second difference between the old 
laboratory approach and the new “natural” approach to the study of concepts 
has to do with the fact that the former approach does not account for the fact that 
instances of a given category may be more or less representative of that category. 
For example, the concept vehicle is well represented by car but not well repre¬ 
sented by elevator. Rosch and Mervis (1975) use the phrase family resemblance to 
refer to how well instances of a category represent that category. Table 12.3 
contains 20 members of six categories that have been ranked by subjects for their 
typicality, with Item 1 being the most typical. That these differences exist isn’t so 
surprising; in fact, the rankings in Table 12.3 are much as we might expect. The 
interesting question is why these differences exist. Wby is knife more typical of 
the category weapon than is scissor si By asking subjects to list attributes of the 
items in Table 12.3 and by then comparing attributes, Rosch and Mervis were 
able to find support for the following hypotheses: 

1. Good members of a category share many attributes with other members of 
that category. For example, car has more in common with the other 19 
instances of vehicle than does elevator. 

2. Good members of a category share few attributes with members of other 
categories. For example, rocket (a poor example of weapon) is more likely to 
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Table 12.3 Typicality of Members in Six Superordinate Categories 


CATEGORY 


ITEM 

FURNITURE VEHICLE 

FRUIT 

WEAPON 

VEGETABLE 

CLOTHING 

1 

Chair 

Car 

Orange 

Gun 

Peas 

Pants 

2 

Sofa 

Truck 

Apple 

Knife 

Carrots 

Shirt 

3 

Table 

Bus 

Banana 

Sword 

String beans 

Dress 

4 

Dresser 

Motorcycle 

Peach 

Bomb 

Spinach 

Skirt 

5 

Desk 

Train 

Pear 

Hand grenade 

Broccoli 

Jacket 

6 

Bed 

Trolley car 

Apricot 

Spear 

Asparagus 

Coat 

7 

Bookcase 

Bicycle 

Plum 

Cannon 

Corn 

Sweater 

8 

Footstool 

Airplane 

Grapes 

Bow and arrow 

Cauliflower 

Underpants 

9 

Lamp 

Boat 

Strawberry 

Club 

Brussels sprouts Socks 

10 

Piano 

Tractor 

Grapefruit 

Tank 

Lettuce 

Pajamas 

11 

Cushion 

Cart 

Pineapple 

Teargas 

Beets 

Bathing suit 

12 

Mirror 

Wheelchair 

Blueberry 

Whip 

Tomato 

Shoes 

13 

Rue 

Tank 

Lemon 

Icepick 

Lima beans 

Vest 

14 

Radio 

Raft 

Watermelon 

Fists 

Eggplant 

Tie 

15 

Stove 

Sled 

Honeydew 

Rocket 

Onion 

Mittens 

16 

Clock 

Horse 

Pomegranate 

Poison 

Potato 

Hat 

17 

Picture 

Blimp 

Date 

Scissors 

Yam 

Apron 

18 

Closet 

Skates 

Coconut 

Words 

Mushroom 

Purse 

19 

Vase 

Wheelbarrow 

Tomato 

Foot 

Pumpkin 

Wrist watch 

20 

Telephone 

Elevator 

Olive 

Screwdriver 

Rice 

Necklace 


From Rosch, E. H., and Mcrvis, C. B. Family resemblances: Studies in the internal structure of categories. Cognitive Psychology , 
1975, 7, 573-605. Copyright 1975 by Academic Press, Inc. Reprinted by permission. 


have attributes in common with vehicle than is gun (a good example of 
weapon). 


Prototypes 

How do we go about recognizing novel objects as members of categories? For 
example, how does a young child decide that a newly encountered animal is a cat 
rather than a bird ? There are two general classes of model designed to answer this 
question: prototype models and feature-frequency models. This section describes 
prototype models, with feature-frequency models to follow. The prototype model 
suggests that we develop a prototype, or a best example, of the category and then 
compare novel patterns with that prototype in order to decide whether that novel 
stimulus belongs to that category (Armstrong, Gleitman, &: Gleitman, 1983). 

Consider the object in Figure 12.7. Is this more like a rectangle or a triangle? 
Think carefully about what you do as you try to reach a decision. In some sense 
you compare Figure 12.7 with an ideal model, or pattern, of a rectangle, don’t 
you? You do the same sort of thing with a pattern of an ideal triangle, too. These 
internal patterns of perfect triangles and rectangles are what psychologists call 
prototypes. 

Some concepts seem to be more perceptual than logical in nature. In these cases 
learning a concept may be more like developing an idealized, internal “picture,” 
or prototype, than like learning some logical rule that can be stated verbally. Once 
the pattern, or prototype, that defines a given concept is developed (for example, 
an internal picture of an ideal triangle), we can use that prototype to make 
decisions about any stimulus that is given to us. We can take a presented stimulus 
and “hold it up against” the prototype. We decide if a given stimulus is a mem¬ 
ber of the category (for example, triangle) by comparing the presented stimulus 
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Figure 12.7 An ambiguous concept. Is this figure more like a triangle or a rectangle? 

with the prototype. If it is similar enough, we include it. If it is too different, we 
reject it. 

Many concepts are defined better by prototypes than by logical rules. For 
example, consider the concept chicken. If someone showed you a bird that you 
had never seen before and asked you if it was a chicken, how would you go about 
making a decision? Would you start thinking about logical rules that define 
“chickenness”? Or would you somehow compare the overall appearance of the 
animal with your internal picture of an ideal chicken? You would probably use 
the prototype approach. In fact, you probably cannot even think of a logical rule 
that defines “chickenness,” much less use such a rule to make a decision about 
category membership. But we all do have some internal prototype of chicken that 
we can use as a standard against which to judge new gallinaceous individuals. 

A good deal of work has been done in connection with the prototype approach 
(see Franks & Bransford, 1971; Lasky & Kallio, 1978; Palmer, 1978; Peterson, 
Meagher, Chail, & Gillie, 1973; Posner, 1970, 1973; Posner, Goldsmith, & 
Welton, 1967; Posner & Keele, 1968; Rosch, 1977). This research suggests that 
we probably do form prototypes in connection with a great many concepts, 
especially when these concepts are of a perceptual rather than a logical nature. At 
the same time, however, we may develop and utilize logical rules that describe 
how class members can differ from the prototype but still be members of the 
category (see Lasky & Kallio, 1978). For example, we may have a chicken 
prototype within us somewhere, but at the same time we may also have rules, 
such as “chickens usually have red wattles, but not always,” that we use to 
reconcile differences between the ideal prototype and the new bird that we are 
considering. 

In other words, and this should not surprise anyone, we probably use both 
prototypes and logical rules in deciding category membership. In fact, as noted, 
some feel that there may not be very much difference between rule learning and 
prototype formation. We are clever, and we use a number of strategies and 
techniques to accomplish this task. 

Feature-Frequency Models 

Although comparing a novel stimulus with a prototype is one way to think about 
how we make decisions concerning category membership, there are other inter¬ 
pretations of the process. According to feature-frequency theory, we count up the 
number of features, or attributes, that our novel stimulus has in common with 
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various categories and assign it to the category with which it has the most feature 
matches (see Kellogg, 1983; Toppino & Bucher, 1983). 

It appears that both prototype and feature-frequency models can help us 
understand the process of classifying novel stimuli. But, as Reed (1982) points 
out, the situation is fairly complicated. He suggests that one type of model may be 
best with certain kinds of stimuli, whereas other stimuli may be best dealt with in 
terms of the alternative type of model. 

ANIMALS AND CONCEPTS 

Thus far we have been discussing concepts as though they were the property of 
humans alone. This is not true. Clearly, animals can form and use concepts, too. 
They can distinguish between members of their own species and members of 
other species. This ability must involve concept formation, perhaps of the pro¬ 
totype variety. Cats have a very clear idea of what is, and what is not, a human. 
Many dogs seem to have a mail carrier concept. Most animals can discriminate 
easily between edible and inedible foods. Apes can apparently utilize language to 
a degree; this must involve the development and use of concepts. Wolves have 
been known to toss pieces of dried caribou skin, bark, and sticks about in a 
Frisbeelike fashion, suggesting the concept toy. Animals have enormously compli¬ 
cated communication systems, many of which involve the learning of concepts. 
The list goes on and on. In fact, there is little reason to assume that the lives of 
animals are any less filled with concepts than are our own. 

But for the doubter who requires a good, solid laboratory demonstration, here 
is an example of concept formation in the duck. Put a piece of duck food in one of 
three cans. Cover this can with a cardboard triangle. Cover the remaining empty 
cans with cardboard rectangles. Allow the duck to nudge off the pieces of 
cardboard and find and eat the food under the triangle. Continue to cover the 
cans in the same fashion, allowing the duck to discover and eat the food under 
the triangle. But on successive trials vary the size, shape, and color of both the 
rectangles and the triangles. Over a series of trials the duck will come to look 
under all of the widely varying triangles but will never look under the rectangles. 
This demonstrates that the duck has learned, and can utilize, a triangle concept. 
For more complete reviews of concept formation, see Bourne, Dominowski, and 
Loftus (1979) and Donahoe and Wessells (1980). For examples of current interest 
in animal cognition see Essock-Vitale (1978), Hulse, Fowler, and Honig (1978), 
and Pepperwood, (1983). 


PROBLEM SOLVING 

We now turn away from the topic of concept formation and direct our attention 
to what psychologists have learned about problem solving. Back in Chapter 1 we 
defined a problem as something that exists when a motivated organism is trying 
to reach a goal but is blocked frotn doing so by an obstacle or obstacles. Problems 
come in many shapes and forms, but they all involve the blockage of motive 
satisfaction. For example, problems can be rather simple, involving memory 
search (for example, “What is the longest word you can think of that spells the 
same thing forward and backward?”). Or problems can be very complex (for 
example, “How do we go about maintaining a worldwide balance of power?”). It 
is clear that each of us faces many different kinds of problem every day. Hence, it 
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is also clear that a firm understanding of the processes involved in problem 
solving would be to our advantage. 

It is not that easy to distinguish among problems, concepts, and cognitive 
activity. We have already seen that some people think of all cognitive activity in 
terms of problem solving. We have also discovered that concept formation can be 
thought of as one type, or form, of problem solving, where the problem is to 
discover the correct concept. 

For our purposes let us make the following distinctions. Cognitive activity is a 
more general term than problem solving, covering a wider range of activities. 
Problem solving is one form of cognitive activity, but not the only one. There are 
types of cognition, such as perceiving, imagining, and remembering, that are not 
easily thought of as problem solving. 

Although this may be an oversimplification, we have also seen that it is helpful 
to think of concept formation as one form of simple problem solving. Basically, 
the kinds of situation that have been studied under the label of problem solving 
tend to be more complex than those studied under the heading of concept 
formation. 

For example, problem-solving tasks typically involve more possible solutions 
than concept-formation tasks. In a concept-formation task, on the one hand, 
there are often only a limited number of possible hypotheses (for example, red, 
black, circle, square ), and these are all provided by the experimenter. In problem 
solving, on the other hand, the number of possible answers can be almost 
unlimited, and none is provided by the experimenter (for example, words that 
spell the same thing forward and backivard). 

Furthermore, solving problems often (but not always) requires considerable 
world knowledge that is not explicitly stated in the problem. This is different 
from the typical concept-formation task, in which most of the required informa¬ 
tion is supplied by the experimenter. For example, finding the correct concept 
among red, black, circle, and square does not require the large vocabulary that is 
essential to finding long words that spell the same thing in both directions. 

Given that we know that problem solving is one form of cognitive activity and 
that concept formation is one form of problem solving, we can turn to the 
important interpretations of problem solving, both classical and modern. 

Gestalt Interpretations of Problem Solving 
INSIGHT AND TRIAL AND ERROR. One of the classic controversies in the field of 
problem solving concerns the issue of insight versus trial and error. Gestalt 
psychologists (see Wertheimer, 1945) argued that the solution to a problem is 
achieved in a “flash of insight.” One minute you do not have the answer, and the 
next moment you do. It is something like a lightbulb being snapped on inside your 
head. The solution to the problem is assumed to be perceived in an all-or-none 
fashion. It is the, “Aha! I have it!” experience. 

In contrast to this insight process, one can think of problem solving as a process 
of trial and error in which subjects plod along in a somewhat blind fashion, trying 
out different solutions until they stumble on the correct one. There is no “flash of 
insight” in this process; rather, the subjects do not even perceive the solution until 
after it has been discovered through trial-and-error activities. 

In support of their insight interpretation Gestalt psychologists pointed out that 
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Figure 12.8 Maier’s (1931) string problem. The strings are to be tied together, but the 
subject cannot reach the two of them at the same time. 


problems are often solved suddenly and permanently. For example, suppose that 
a banana is suspended just beyond the reach of a chimpanzee in a cage containing 
several scattered boxes. The Gestalt psychologists observed that after several 
futile attempts to acquire the banana by reaching the apes would become quiet for 
a moment and then, all of a sudden, stack the boxes, climb on top of them, and 
reach the banana, as if they had experienced a flash of insight. Once this solution 
was perceived, it tended not to be forgotten; it was permanent. 

Another problem that is relevant to the question of insight versus trial and error 
(Maier, 1931) is shown in Figure 12.8. Two strings are hung from the ceiling of a 
room, and a subject is told to tie the ends of the strings together. The problem 
here is that the strings are too short for the subject to pick up one, walk over to 
the other, and tie them together; some other way to grasp both strings at once 
must be found. To help the subject, a number of objects are also in the room, 
including a hammer and a pair of pliers. (Stop here and try to solve the problem 
before going on.) 

This problem can be solved quite simply by tying the hammer (or the pair of 
pliers) to one of the strings, swinging it like a pendulum, grasping the other string, 
catching the “pendulum,” and then tying the strings together. To find this solu¬ 
tion, however, the subject must stop thinking of the hammer as a tool and instead 
think of it as an object with mass that is capable of acting like a pendulum. 

Gestalt psychologists observed that solutions in situations such as this one 
often have all the characteristics of the insight experience. The subjects, in fact, 
will often say, “I’ve got it,” and proceed to solve the problem. 

But does this kind of evidence really eliminate trial-and-error processes? Prob¬ 
ably not. For example, it is entirely possible that the so-called insight experience is 
really nothing more than mental trial and error. As the subjects stand there 
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holding one string and looking at the other, they may be “trying out” alternatives 
on a mental level. They will “try” various alternatives on a mental level without 
actually performing them on a physical level. Eventually, they will hit on the 
correct alternative and immediately recognize it as the right one. “Aha!” So even 
though we all do experience the flash of insight that the Gestalt psychologists 
were talking about, mental trial and error may be at the basis of this subjective 
experience. At present, there is no good way to resolve this controversy complete¬ 
ly. It may well turn out that both trial and error and true insight operate in 
problem solving. , 

FUNCTIONAL FIXEDNESS. Gestalt psychologists also introduced the interesting 
idea of functional fixedness. This refers to the fact that an object is defined by its 
uses. Consider the problem developed by Duncker (1945) (see also Weisberg, 
DiCamillo, & Phillips, 1978; Weisberg & Suls, 1973). A subject w r as given a 
candle, a book of matches, and a box of tacks and was told to “get the candle up 
on the wall so it will burn properly.” (Again, stop reading and try to solve the 
problem.) 

A solution to this problem that is frequently missed is to use the box containing 
the tacks as a “candleholder”: Take the tacks out of the box, tack the box to the 
wall, and put the candle in the box. People tend not to think of using the box as 
part of the solution to the problem because they think of it as a container and not 
a “shelf”; the nature of the box is “functionally fixed.” To solve the problem the 
subjects must overcome this tendency to think of the box as nothing more than a 
box; they must be able to think of using it in a new and unusual way. 

Interestingly, Duncker presented his candle problem to subjects with the tacks 
either inside or outside the box. When the tacks were outside the box, more 
subjects were able to solve the problem, as the perception of the box as a 
container for the tacks was presumably lessened. 

The Maier (1931) string problem mentioned above also contains the element of 
functional fixedness. To solve that problem the subject must overcome the 
tendency to think of the hammer or pair of pliers as tools and to break away into 
thinking about using them as pendulums. 

The initial use of boxes as containers inhibits their later use as a shelf. The 
initial use of pliers as tools inhibits their use as a pendulum. In both cases the first 
experience inhibits the second. We will learn more about transfer effects in 
problem solving, both positive and negative, in an upcoming section of this 
chapter. 

The experiments of the Gestalt psychologists are revealing demonstrations of 
problem solving, but they do not really tell us anything about how people solve 
problems. Great importance was placed on the role of insight, but the Gestalt 
experiments offer no clues about how or why someone suddenly interprets a 
string as the arm of a pendulum or a box as a candleholder. As in perception and 
memory, the vagueness of the Gestalt theories keeps them from being of signif¬ 
icant use to us in truly understanding problem solving. 

Subgoals 

As the information-processing approach to psychology developed, however, 
theories better able to describe problem solving became available. Many of these 
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modern theories and concepts draw on a step analysis of problem solving de¬ 
veloped by Polya (1957). According to Polya, problem solving can be thought of 
as involving the following four steps: 

1. Understand the problem (determine the goal, the conditions of the problem, 
and the available data). 

2. Devise a plan to guide the solution of the problem. 

3. Carry out the plan. 

4. Check back: Review the solution, and correct the plan as necessary. 

Step 2 in this sequence is assumed by many to be the most critical phase in 
problem solving. Polya felt that some sort of creative thinking was necessary 
during Step 2, so that the overall problem would be broken down into a set of 
secondary goals, or subgoals, each more easily solved than the original. Thus, the 
essence of making a plan is the definition of subgoals. Similarly, these subgoals 
might themselves need to be broken down into subgoals, and so on. Once the 
problem has been reduced to a set of directly solvable tasks, they can be carried 
out, and the problem can be solved. 

For example, consider the problem of writing a book. It is tough to sit down 
and begin writing from page one. So we break the problem down into subgoals. 
We might set as a subgoal the development of a list of chapters. Then within each 
chapter we might address several issues. Writing about one issue within one 
chapter of the overall book is a subgoal that is not so difficult. Once that subgoal 
is reached, another can be tackled without having to panic over the somewhat 
overwhelming task of “writing a book.” By dealing with smaller, more easily 
handled subgoals, we can solve the overall problem. A difficult, complex problem 
can be managed if it can be broken down into a set of smaller, simpler problems 
(see Miller, Galanter, & Pribram, 1960). 

Algorithms 

It is all well and good to say that the best strategy is to break complex problems 
down into simple ones. But deciding how to do that is not always easy. There may 
be several ways to divide up a problem, but only a few of these may actually 
increase the likelihood of reaching a solution. Sometimes the substeps we should 
take are quite clear and distinct, but sometimes they are not. 

Let us consider first the case in which the subgoals are clear and obvious. Take, 
for example, a simple arithmetic problem, such as multiplying two numbers. 
There is a very specific set of actions that can be taken in this problem—defined 
by the laws of mathematics—and these actions can be applied to the multiplica¬ 
tion problem in a very direct way to obtain an answer. In fact, these actions are so 
specific and well defined that, if they are properly applied in the proper order, a 
correct answer to the problem will always be obtained. As such, these actions 
represent what psychologists call an algorithm for multiplication. An algorithm is 
a procedure that, if correctly applied, will always result in success. 

Heuristics 

Unfortunately, we cannot always tell ahead of time how a problem should be 
broken down. We do not always have clear, simple rules that will always lead to a 
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solution. We do not always have an algorithm for our problem. In these cases, we 
must turn to heuristic problem solving rather than algorithmic problem solving. 
In heuristic problem solving we try to make good guesses about what might be 
the best way to handle the problem. 

Consider the game of chess, for example: No simple set of actions exists for 
chess—the game is simply too complex. We cannot win a chess game in the same 
way that we solve a multiplication problem; we do not have a set of clear, simple 
actions to follow. But there are general “hints” that, if observed, will greatly 
increase our chance of winning. These are the kinds of suggestion that we can find 
in a chess book: “Keep your knights off the edges of the board.” “Protect your 
queen above all else.” And so on. These rules are the heuristics of chess. They are 
based on a sound understanding of what is needed to win, and, although they will 
not guarantee success, they greatly improve our chances. Observing these and 
other heuristics will sometimes prevent us from finding a winning solution (for 
instance, in a game in which we have to sacrifice our queen in order to win), but, 
more often than not, they will help us find a solution to the problem. 

Thus, heuristic problem solving, which is probably very common in our every¬ 
day life, involves making educated guesses about what would be the best way to 
solve a problem. For example, suppose that you wish to become the friend of an 
attractive person. You do not quite know how to do it. That is, you do not have a 
clear algorithm for the problem. So you make some educated guesses. You know 
that the person is shy, so you do not just jump all over him. You adopt a cautious, 
casual approach. You know that he likes the out-of-doors, so you try talking 
about nature. You do your best to break the problem down into simpler subprob¬ 
lems, based on your best guess about what will work. You search through all of 
the possible things that you might do and try to find a sequence of actions that 
will lead to success (see also Nisbett, Krantz, Jepson, 8c Kunda, 1983). 

Strategies: Working Backward 

Thus far we have noted that problem solution can be facilitated by defining 
subgoals and by making educated guesses about what to do to solve the problem. 
In addition, psychologists have noted that there are overall strategies that can be 
adopted in our attempts to apply heuristics and generate subgoals. 

For example, Newell and Simon (1972) discuss the strategy of working back¬ 
ward. In a very complex problem it is easy to work on the problem without 
knowing whether you are really making any progress. Something is being done, 
but it may not be taking you any closer to the solution. This is especially true 
when the problem has only one solution (as opposed to games such as chess, 
which have many winning positions). In such problems working backward can be 
a very useful strategy. The idea is to reverse the problem so that you start with the 
goal and try to reach the given information. Solving a problem in this way still 
requires effort, but because your work is always in direct contact with the goal, 
many “blind alleys” can be avoided. 

Suppose that you want to trace a route from New York to San Francisco on a 
road map. You could start in New York and work your way west. But because 
you will not be able to predict what is ahead, you may end up on a detour to Des 
Moines. If you start in San Francisco and work backward toward New York, 
however, you may be able to avoid some of these “dead ends.” 
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There are many other strategies that can be adopted in problem solving. As 
Sweller (1983) points out, exactly which strategy you follow will depend on the 
nature of the problem and your understanding of that problem. With one type of 
problem, for example, you might decide that your best strategy is to take your 
past moves into account each time you consider a new move, whereas in another 
situation you might deem past moves irrelevant. 

Protocol Analysis 

Thus far it has been suggested that, when solving problems, people do the 
following: 

1. They break large problems down into simpler, smaller problems. 

2. They make guesses about what would be best to do to solve the problem. 

3. They follow overall strategies in their efforts to find a solution. 

You might reasonably ask what kinds of data lead psychologists to conclude that 
these three principles of problem solving are valid. Much of the modern work on 
problem solving involves protocol analysis , or the study of the verbal reports that 
subjects give while they are solving a problem. In essence, the subjects “think out 
loud” while solving the problem, and the psychologist then examines these 
reports. 

Needless to say, many investigators through the years have objected to the use 
of this sort of self-reporting. They have argued that self-reports are generally 
unreliable and that subjects do not really know or cannot really tell you what they 
are doing. Nonetheless, it appears that, even though these data are unacceptable 
to some, the careful analysis of subjects’ protocols, combined with more objective 
measures when possible, has significantly advanced our understanding of prob¬ 
lem solving. Let’s look at the kind of problem-solving situation in which subjects’ 
verbal reports have been recorded. 

One type of problem that has been studied extensively by protocol analysis is 
the “cryptarithmetic” problem. A set of letters is arranged in the form of an 
arithmetic problem, such as: 

DONALD 
+ GERALD 

ROBERT 

Each letter stands for a different digit from 0 to 9, and the value of one of the 
letters is given (here, D = 5). The subject’s goal is to figure out the values of the 
other letters. (Again, you might stop and try solving this problem. This is a tough 
one.) 

Logically speaking, the information that D is equal to 5 is all that is needed to 
solve the problem. As a start, it is clear that, because D = 5, the T in the 
right-hand column must be 0. This is because D (5) plus D (5) must equal (10). 
The values of the other letters can be deduced in similar ways. The general 
procedure is to ask the subjects to talk out loud as they work. (The solution is 
T = 0, G = 1, O = 2, B = 3, A = 4, D = 5, N = 6, R = 7, L = 8, E = 9.) Then, 
by looking at what subjects have said, the investigator can determine if subgoals 
were established, if heuristics were employed, and if strategies were followed. 
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Both Bartlett (1958) and Newell and Simon (1972) have studied this particular 
problem. What they found was that the steps reported by subjects generally 
conformed to the three principles of problem solving that we have been discuss¬ 
ing. Subjects do break down the overall problem into smaller problems, they do 
make “best guesses” about what to do, and they can and do follow strategies 
(such as working backward). 


Simulation Models 

A good deal of effort has been put into the construction of computer programs 
that incorporate our three principles of problem solving (subgoals, heuristics, and 
strategies). These programs have been given names like the Logic Theorist 
(Newell, Shaw, & Simon, 1958), and the General Problem Solver, or GPS (Ernst 
& Newell, 1969; Newell & Simon, 1972). The general idea behind the construc¬ 
tion of these programs is that, if they represent good or useful models of human 
problem solving, they must solve problems in much the same way as people do. 
Hence, a great deal of attention has been given to a comparison of how humans 
and computer programs solve the same problem. 

The results have been mixed. Whereas these programs certainly seem capable 
of solving a wide range of problems, they often do not do it in the same way that a 
real, live, warm-blooded person does. For example, GPS makes moves without 
consideration for past moves. This simply does not seem to be true of humans, 
who do consider past moves when choosing their next move. Hence, many of the 
computer programs are of limited value as psychological theories, even though 
they do incorporate heuristics, subgoals, and strategies. The reader interested in 
pursuing computer simulation of problem solving is referred to Atwood and 
Poison (1976), Atwood, Masson, and Poison (1980), Greeno (1974), Hayes and 
Simon (1974), Jeffries, Poison, Razran, and Atwood (1977), Simon and Hayes 
(1976), Simon and Reed (1976), and Thomas (1974). 

Transfer in Problem Solving 

What transfer effects result from solving a series of related problems? We have 
already seen that both positive and negative transfer effects can sometimes oc¬ 
cur. Sometimes solving one or more problems will help you to solve subse¬ 
quent problems, and sometimes it will hinder your solution of those subsequent 
problems. 

As an example of positive transfer, we can note that monkeys required to solve 
successive discrimination problems become better and better at it until, after 
several hundred similar problems, they are able to solve additional problems in 
one trial. And you certainly remember that practice on successive math problems 
helped you to quickly solve similar problems. 

The functional-fixedness effect described earlier represents an instance of nega¬ 
tive transfer in problem solving. Perhaps an even clearer example is provided by 
Luchins’s (1942) “water jug” problem. Subjects were asked, “Given three jugs 
that contain 21, 127, and 3 quarts, how would you measure 100 quarts?” Nearly 
all of Luchins’s subjects reported filling the largest jar and then removing 27 
quarts with the help of the other two jars. Subjects then went on to solve a second 
problem: “How would you measure 20 quarts with 23, 49, and 3 quart jars?” 
Surprisingly, 81% of Luchins’s subjects solved this problem by a method identical 
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Table 12.4 Jar Problems 


PROBLEM 


JAR CAPACITIES 


AMOUNT OF 
WATER TO 

BE OBTAINED 

JAR A 

JARB 

JARC 

1 

21 

127 

3 

100 

2 

14 

163 

25 

99 

3 

18 

43 

10 

5 

4 

9 

42 

6 

21 

5 

20 

59 

4 

31 

6 

23 

49 

3 

20 

7 

10 

27 

7 

3 


In each case, use a water tap and the jars to obtain the amount of water in the right-hand column. 


to the first problem, one that involved all three jars; control subjects uniformly 
solved it the simple way, by filling the 23-quart jar and taking out 3 quarts (see 
Table 12.4 for additional examples). Thus, previous experience with a particular 
kind of problem can both help and hurt our subsequent efforts to solve additional 
problems (see also Table 12.5). 

But even though positive and negative transfer effects in problem solving have 
been identified, the situation is far from clear. Positive transfer effects are often 
weak or nonexistent. One theme that does seem to appear repeatedly is the idea 
that the occurrence of positive transfer seems to depend on how the initial 
problem was solved. For example, Mayer (1974) found that subjects who were 
initially taught to solve problems by algorithm (rule) did best when transferred to 
nearly identical problems. Subjects taught to solve problems by a heuristic 
method (discovery), in turn, did better on subsequent problems that required 
significant understanding of the problem’s concepts (see also Egan &: Greeno, 
1974; Jeffries, 1979; Perfetto, Bransford, & Franks, 1983; Reed, Ernst, Sc 
Banerji, 1974). 


Knowledge and Problem Solving 

Closely related to the question of transfer in problem solving is the issue of how 
important knowledge about the general domain of a problem is in the solution of 
a problem. Specifically, if you know more about a problem area, are you more 
likely to be able to solve problems in that area? Can biologists, with training and 


Table 12.5 Another Example of Negative Transfer 


Read the words aloud from top to bottom. 

MACARTHUR 

MACINTOSH 

MACCARTHY 

MACBETH 

MACDOWELL 

MACKENZIE 

MACDONALD 

MACHINES 

MACDILLON 

Now look at the next-to-last word. 
You may have mispronounced it. 
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knowledge in the field of biology, solve problems related to biology faster and 
more easily than people without knowledge of biology? Can lawyers solve legal 
problems faster than nonlawyers? 

The intuitive answer is yes, and the data support that hunch. Generally speaking, 
knowledge about a given area can facilitate problem solution within that area. 
Prior knowledge “transfers” to the current problem. 

Let us look at a couple of examples of the way in which knowledge aids 
problem solving. DeGroot (1946, 1965) assembled a group of chess experts and a 
group of novices. He compared these two groups in terms of general memory 
ability and found that they were equivalent; the chess experts did not have better 
memory abilities than novices. Next, both groups were shown two arrangements 
of chess pieces on a board, forming either a meaningful board arrangement (taken 
from the middle of an actual chess game) or simply a random arrangement of the 
pieces on the board. Each board was exposed for a few seconds, and the subjects 
were then asked to reconstruct the placement of the pieces to the best of their 
ability. DeGroot found that the experts were better than the novices at reproduc¬ 
ing the meaningful board positions but were no better than the novices when it 
came to recalling the random positions. This suggests that the experts somehow 
were better able to use their past, intensive experience with the game to recognize 
and encode board positions into a set of meaningful chunks, such as “the castled 
black king is forked by a knight.” They did not have to remember the locations of 
many isolated individual pieces. This ability to chunk, or recognize and remem¬ 
ber, patterns of two or more pieces, rather than isolated locations, naturally led to 
superior recall by the experts on the meaningful positions. 

Thus, it can be argued that experts’ ability to win games (solve problems) is 
facilitated by knowledge about common (and not so common) patterns of pieces 
that occur in the game. Their task in planning future moves is greatly simplified. 
They can think about alternative changes of simple patterns, or chunks, each 
involving two or more pieces, rather than having to remember and consider 
alternative placements of each and every piece (see also Chase &c Simon, 1973). 

The fact that knowledge about a field can aid problem solving has been 
demonstrated in other areas, too. For example, Reitman (1976) has found similar 
effects when comparing Go masters with novices at the game of Go. The impor¬ 
tance of knowledge in solving computer-programming problems has been de¬ 
scribed by Reitman, McKeithen, Rueter, and Hirtle (1980). Finally, Bhaskar and 
Simon (1977) report findings suggesting that general knowledge about chemical 
engineering can facilitate the solution of chemical-engineering problems. 

But before we all go running off convinced that prior knowledge always helps 
problem solving, we should note that there are exceptions. For example, Weis- 
berg, DiCamillo, and Phillips (1978) had subjects first learn a paired associate 
composed of the word candle and box embedded in a list of similar paired 
associates. The subjects then tried to solve the candle-on-the-wall problem de¬ 
scribed earlier (Duncker, 1945). That was the problem in which subjects were 
given a candle, matches, and a box of tacks and asked to mount the lighted candle 
on the wall. 

If prior information always helped problem solution, you would expect these 
subjects to do well on the problem. After all, they had almost been given the 
answer in the form of the candle—box paired associate. But these authors found a 


354 PART FIVE/COGNITIVE PROCESSES 






facilitative effect only under certain circumstances. Specifically, if before trying to 
solve the problem the subjects were told that one of the just-learned paired 
associates could help them solve the problem, then it did. But if the subjects were 
told nothing about the relevance of the paired associates, their performance was 
not improved. Thus, simple knowledge about a problem, without a realization of 
the relevance of that knowledge, is not enough to aid problem solution. 

Planning and Problem Solving 

Planning a sequence of steps to be taken appears to be important in problem 
solving. Although planning may not be all that crucial in some simple, artificial 
laboratory problems, real-life problems do seem to benefit from some planning. 
More complex problems often need more planning. For example, Hayes-Roth 
and Hayes-Roth (1979) studied how people go about a series of errands in a given 
day. In solving this sort of problem subjects go to great extremes in planning 
where to go, when, and in what order. We all are familiar with this experience, so 
there is no need to go into detail. The point is that a little planning saves 
backtracking and wasted effort. Greeno, Magone, and Chaiklin (1979) have 
demonstrated the importance of planning in the solution of geometry problems 
(see also Hanley &: Levine, 1983). 

Ill-Defined Problems 

Ill-defined problems are problems for which we have no clear way of judging 
whether a solution is correct. When we are faced with building a house, selecting 
a course, or composing a musical composition, we are certainly faced with 
important problems. But even when we have “solved” them, we are not sure that 
our solution is the best one. People can disagree about any solution we arrive at. 
Some people will feel that our new house is “just perfect,” and others might 
conclude that they could never live with those “funny windows.” 

Ill-defined problems are troublesome because (1) they may actually be more 
common in our everyday life than well-defined problems and (2) they lead to 
continued uncertainty: “Should 1 have gone to medical school?” “Should we have 
painted the walls blue?” With well-defined problems (for example, How much is 
2 + 2?) there is no disagreement and uncertainty about the solution. But with 
ill-defined problems there always is. 

Still, life must go on, and we must solve ill-defined problems. How do we do it? 
Matlin (1983) has outlined several strategies we use, as follows: 

1. We break the problem down into several subproblems. We might say, “Look, 
let’s just worry about the design of the kitchen today and leave the rest 

for later.” 

2. We can add structure to the situation that limits possibilities. “So we agree it 
will be a modern design with lots of redwood. Now, what can we do within 
those constraints?” 

3. We can start work on the problem before we fully understand it. “Let’s just 
go ahead and put in the bathroom there. Later we’ll see how everything else 
will fit.” 

4. We can stop before the “perfect” solution is reached. “It’s good enough. I 
like it. Let’s quit.” 
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Figure 12.9 Materials used in demonstrating that chimps can solve problems using 
analogical reasoning. (Adapted from Gillan, D. J., Premack, D., and Woodruff, G. 
Reasoning in the chimpanzee I. Analogical reasoning. Journal of Experimental 
Psychology, 1981, 7, 1 — 17.) Copyright 1981 by the American Psychological Association. 
Reprinted by permission.) 


ANIMALS AND PROBLEMS 

As we have seen, animals are capable of solving many problems. And in Chapter 
13 we will address the fascinating subject of language use by apes. But to close 
this chapter we will describe an example of animal problem solving that under¬ 
lines the fact that animals are capable of very complex cognitive activity. Gillan, 
Premack, and Woodruff (1981) wanted to find out if chimps could solve problems 
involving analogical reasoning. The materials they used are shown in Figure 12.9. 
The general form of the problem was, “A is to A' as B is to which of two 
alternatives, B' or C?” A lock (A) is to a key (A') as a closed paint can (B) is to 
which of the two alternatives, a can opener (B') or a paint brush (C)? The chimps 
were able to handle this problem; they chose B', indicating that they understood 
that a lock is to a key as a closed can is to a can opener. This demonstrates 
considerable reasoning power on the part of the animal. 

SUMMARY 

1. A concept is a symbol that stands for a class of objects or events that possess 
common attributes. 

2. Concepts can be verbal or nonverbal, can be simple or complex, and are 
related to one another in complicated ways. They help us think. 

3. Concept formation can be thought of as a special kind of problem solving. 
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4. In a typical experiment subjects are asked to judge whether successive stimu¬ 
li, which vary along a number of dimensions, are members of the concept the 
experimenter has in mind. 

5. Concept learning may involve attribute discovery, rule discovery, or both. 

6. Concept-learning difficulty increases as rule complexity increases. 

7. The old S—R interpretation of concept learning argues that concept forma¬ 
tion is similar to stimulus discrimination. This interpretation avoids having to 
think about what goes on “in the mind.” 

8. But the hypothesis-testing approach has pretty well replaced the older S—R 
conception. According to this view, subjects test successive guesses, or 
hypotheses, about what the correct hypothesis is. Hypotheses are tested and 
rejected until the correct one is discovered. 

9. In conservative-focusing experiments the subject can demand to have any 
stimulus classified, rather than having to wait until that stimulus appears in a 
random order, as is the case in most concept experiments. In conservative 
focusing the subject focuses on one dimension at a time and investigates it 
thoroughly before moving on to another dimension. 

10. Conservative focusing is an excellent strategy, but only when the subject is 
free to call for certain stimuli and only when the concept involves a single 
dimension. 

11. Levine’s work with response patterns during blank trials allows us to deter¬ 
mine what hypothesis, if any, people are using and when, and to what 
hypothesis they change. 

12. Natural categories, as distinct from experimenter-defined laboratory catego¬ 
ries, are hierarchical. 

13. Superordinate, basic-level, and subordinate categories have been defined. 

14. Basic-level categories have common attributes, motor movements, and 
shapes. Members of these categories are also used to identify objects. 

15. Instances of a category may be more or less representative of that category. 

16. Complex concept formation has been thought of as prototype formation. 

17. Feature-frequency models of concept formation have also been developed. 

18. In prototype formation an ideal pattern, or image, of the concept (such as 
triangle) is developed and used as a standard against which to judge any 
given stimulus that is to be categorized. 

19. However, it also appears that, even when a prototype is used, rules are 
developed that indicate just how much a stimulus can differ from the pro¬ 
totype and still be called a member of the category. Thus, both rules and 
prototypes are probably used simultaneously. 

20. Animals form and use concepts. 

21. Problems exist when a motivated organism is blocked from reaching a goal. 

22. Problem solving is one form of cognitive activity. 

23. Gestalt psychologists emphasize the flash of insight in problem solving. They 
note that problem solving is often sudden and permanent. 

24. But the insight experience may be nothing more than the result of mental 
trial and error. 

25. Functional fixedness refers to a failure to solve a problem because one or 
more of the available elements is perceived as being usable only in a limited, 
prescribed, unhelpful manner. 
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26. People can solve problems by breaking them down into simpler problems, or 
subgoals, each of which is easier to solve than the overall problem. 

27. Algorithmic problem solving is said to occur when a procedure that will 
always lead to a correct solution is followed. 

28. Heuristic problem solving occurs when we make reasoned guesses about 
what might be the best thing to do. This process helps problem solving but 
does not guarantee a correct solution. 

29. Organisms follow overall strategies while solving problems, such as working 

backward. , 

30. In protocol analysis the subjects’ self-reports of what they are doing during 
problem solving are examined. 

31. Many simulation models of problem solving, such as GPS, have been de¬ 
veloped. These programs are interesting but of limited value in terms of being 
accurate psychological theories. 

32. Positive and negative transfer effects can sometimes, but not always, be 
demonstrated in problem solving. 

33. Knowledge about the general domain of a problem can facilitate problem 
solution. 

34. Planning helps problem solving, particularly with complex problems. 

35. Ill-defined problems are those for which we have no clear way of judging 
whether a solution is correct. 

36. There are several strategies for solving ill-defined problems. 

37. Animals can solve complex problems. 
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THE IMPORTANCE OF LANGUAGE 

Did you ever stop to consider the fact that, by spending a few days in your local 
library, you can know more about physics than Galileo ever knew, more about 
psychology than Freud ever dreamed of, and more about natural selection than 
Darwin was aware of? Language contributes to this ability. It ensures, through 
written and spoken records, that knowledge is cumulative and that we will not 
have to rediscover everything that our ancestors discovered. Language enables us 
to stockpile information. It allows us to build on previous information. It certain¬ 
ly allows us to store information in excess of 'the capacity of our immediate 
memory. 

In addition, language allows us to communicate with one another in highly 
sophisticated ways. It is true that animals of many species communicate with one 
another. They do have their own interesting languages. Bees can communicate the 
location of a food source by varying their bodily movements (von Frisch, 1974). 
And, as we shall see in the last section of this chapter, apes appear capable of 
using the rudiments of language as we know it. But in general, animal com¬ 
munication is limited when compared with that of the human being. We are the 
true masters of language, and our use of language seems to set us apart from the 
rest of the animal kingdom. 

The use of language is such an everyday activity for us that we take it for 
granted. Even the simplest studies of language show, however, that language 
comprehension and use is incredibly complex. Language is so intertwined with 
our thought processes that it is difficult to imagine what thinking would be like 
without it. 

Hence, if we are to understand the so-called higher, or cognitive, mental 
processes, it almost goes without saying that we must comprehend language 
processes. Even though there probably are cognitive events that do not involve 
language, so much of our thinking does involve language that the two can hardly 
be understood in isolation from each other. Table 13.1 lists some of the ways in 
which we use language. You can probably think of some additional examples. 

In summary, language helps us in at least three important ways: 

1. It helps us communicate effectively. 

2. It allows us to store enormous amounts of information. 

3. It aids our thought processes by providing a system of symbolic represen¬ 
tation. 

The study of language use and language learning is a wide-ranging field, 
touching on many of the issues that we have already discussed in this text. In this 
chapter we shall first consider language development. Then, in increasing order of 
complexity, we shall explore the nature and storage of words, the construction, 
comprehension, and retention of sentences, and memory for prose. Finally, in 
keeping with our concern about the relationship between animal and human 
learning, we shall discuss language in apes. 

LANGUAGE DEVELOPMENT 

In this section on language development the steps that humans follow in their 
initial acquisition of language are described briefly. The intent is not to theorize 
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Table 13.1 Some Common Ways We Use Language 


Solving problems 
Describing events 
Describing objects 
Telling others what to do 
Acting 
Teaching 

Guessing or answering 

Telling stories 

Asking questions 

Expressing emotions 

Greeting and thanking people 

Singing 

Reading 

Speculating 

Predicting and reporting 

Daydreaming 

Translating 

Learning and storing information 
Creating 


about mechanisms underlying language (that is saved for later sections) but, 
rather, to outline some of the interesting regularities that appear in our early 
life. 

Aside from crying, coughing, sneezing, and hiccuping, the sounds that newborn 
babies make are quite limited. But not for long. It is not too many months before 
most babies begin to put out an impressive array of verbal sounds, usually called 
babbling. The interesting thing about all of this babbling, which often seems to be 
done with great enthusiasm, is that it contains many of the basic sounds of 
full-blown adult speech. It seems as if the very basic verbal units, or sounds, are 
almost innate. At the very least they occur spontaneously and do not need to be 
“learned” in the sense that they must be practiced before they are perfected. It 
almost seems that the acquisition of language refers to the ordering and organiz¬ 
ing of units of sound that are given, or innate. 

There is evidence that all human babies, regardless of where they live or under 
what conditions, engage in comparable amounts and types of babbling (see 
Atkinson, MacWhinney, & Stoel, 1970; Lenneberg, 1967). The environment does 
not seem to have much of an initial impact on this babbling. As we grow and 
interact with the environment, however, we begin to restrict our vocalizations to 
those that compose the elements of the language spoken in our environment. 
Chinese children begin to concentrate on the sounds that make up the Chinese 
language. French children begin to make “French” sounds. By the time we are 
adults, we find it difficult to produce the sounds necessary for another language, 
even though we could do it as infants. 

At about 1 year of age (the time varies from child to child), children begin to 
acquire a few words. They are on their way to building a vocabulary. Although it 
starts slowly, the acquisition of a vocabulary soon accelerates. By the time they 
are 5 years old, children may have a vocabulary of several thousand words. 
Growth of the vocabulary can continue throughout life. For example, you will 
now add to your vocabulary by learning that holopbrastic speech (McNeil, 1970) 
refers to the fact that young children often use a single word to express an entire 
idea. For instance, a child may say, “No,” but mean, “No, 1 don’t want to be put 
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down. Hold me and tell me wonderful things.” Children apparently understand 
much more than they are able to verbalize. 

The first sentences children produce, at somewhere around the age of 18 
months, tend to be composed of two words. The child will say, “More juice,” or 
“No bed,” or “See baby.” Again, starting slowly, the child soon accelerates in the 
acquisition of two-word sentences, using several thousand different ones by the 
age of 2 to 2Vi. 

Telegraphic speech refers to the tendency of children at this age to maintain the 
correct order of words but to leave out many of the less important ones. For 
example, a child might say, “Doggie baby,” meaning “The doggie licked the 
baby.” The understanding is there, but the technical language ability is not. 

Between 2 and 3, children begin to produce longer sentences. More complex 
grammatical forms appear. Children vary widely in how quickly they acquire 
language. Some are amazingly quick about it, whereas others lag. Still, it is fair to 
say that almost all children show quite rapid development in this area. We are 
“born to talk,” and we waste little time getting to it. 

Interestingly, even though some children acquire grammar faster than others, 
the order in which that grammar is learned seems to be constant from child to 
child. They learn the rules of grammar in the same sequence even though the rate 
of acquisition may differ. For example, adding the suffix -ing appears early, 
whereas adding ’s to indicate possession usually occurs later. Finally, children 
seem to try out, or test, various guesses, or hypotheses, about what is correct 
grammar. Through feedback from the adult world they appear to alter and 
correct their hypotheses. They do not just jump right in with a complete copy of 
adult speech. 


THEORIES OF LANGUAGE ACQUISITION 
Language acquisition has been interpreted in a number of ways. For example, the 
acquisition of word meaning has been thought of in terms of classical condition¬ 
ing. The spoken word doggie is repeatedly paired with a real dog and its actions. 
The spoken word doggie is a CS, whereas the actual dog is the UCS. Whatever 
reaction the child has to the real dog will become classically conditioned to the 
word doggie and will constitute its meaning. 

Similarly, language acquisition has been thought of in terms of instrumental 
conditioning. Each time a young child says a word or uses a little sentence, he or 
she is praised and rewarded by adults. Just as a rat learns to press a bar for food 
reinforcement, the child learns to speak and use language in return for social 
rewards (see Bloom, 1975; Bloom & Esposito, 1975). 

Shaping is probably important in the instrumental conditioning of languages. 
For example, if a 2-year-old says, “See my ’flection?” she will be rewarded, 
because to say ’ flection at that age represents a pretty close approximation of 
reflection. But as the child grows older, adults will demand closer and closer 
approximations to the word reflection before a reward will be delivered. 

These conditioning interpretations of language acquisition have been severely 
criticized lately. Specifically, it has been argued that language acquisition is too 
complex to be interpreted in terms of a simple S—R framework. It has been 
pointed out that language use has a generative, creative aspect, which is difficult 
to reconcile with simple S—R notions. Children use language in new and novel 
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ways, as though they are testing hypotheses. They generate new language patterns 
that they have never heard before. 

All of these criticisms, coupled with other developments, have led to what is 
generally known as the psycholinguistic approach to language acquisition. 
According to this approach, humans are seen as genetically prepared to acquire 
and use language. We are ready and willing in a biological sense. In addition, the 
psycholinguistic approach argues that we learn to use rules and test hypotheses 
when we acquire language, rather than learning specific S—R connections. 
Although the psycholinguistic approach does not necessarily disprove the S—R 
approach, it has become the favored way of thinking about language acquisition. 
In the following sections we shall be using the language of the psycholinguistic 
approach. At the same time you should keep in mind that the older S—R con¬ 
ditioning approach, although out of vogue, has not been completely discredited. 
For example, to note that we learn rules when we learn language is not to deny 
the validity of the conditioning approach. It can always be said that use of the rule 
is reinforced just as more discrete, identifiable responses are reinforced. 

WORDS: STRUCTURE AND STORAGE 
Having briefly looked at the development of language in the human infant, let us 
turn our attention to what psychologists have learned about the nature of lan¬ 
guage. The rest of this chapter will be devoted to increasingly complex levels of 
language. Specifically, a consideration of the structure and storage of words 
will be followed by a discussion of how words are combined into meaningful 
sentences. Finally, the grouping of these sentences into meaningful prose will be 
considered. 


Phonemes and Morphemes 

All spoken languages are composed of between 15 and 85 basic sounds. English 
consists of something like 45 sounds rearranged in many ways. These basic sounds, 
or building blocks, are called phonemes by linguists and psychologists. Many of 
the English phonemes correspond to the sounds of the letters of the alphabet, 
although some of them involve combinations of letter sounds. Thus the t sound at 
the beginning of tack is a phoneme, and so is the sh sound at the beginning of 
should. 

Phonemes alone generally have no meaning. They must be combined with one 
another to form meaningful units. These combinations of phonemes are called 
morphemes. Morphemes are the smallest meaningful units in a language. A 
morpheme is most often, but not always, composed of two or more phonemes. 
For example, the words a and / are morphemes composed of single phonemes. 

Many, but not all, morphemes are words. For example, run and tap are mor¬ 
phemes that are also words. But prefixes and suffixes such as pre- and -ness are 
also morphemes even though they are not words. They are morphemes because 
they have meaning, as in prejudged and happiness, even though they are not 
words by themselves. Many words are composed of single morphemes, such as 
run and tap. But many other words are composed of more than one morpheme. 
Thus, helpful is a word composed of two meaningful morphemes ( help and ful). 

Languages have rules about the ways in which phonemes and morphemes can 
be combined into “legal,” or acceptable, words. For example, certain phoneme 
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combinations are not acceptable in English. How many English words can you 
think of that begin with hp} None, because this combination is not acceptable 
under the rules of English. For that matter, an initial b is never followed by b, c, d, 
f, g, h, j, k, 1, m, n, p, q, r, s, t, v, w, x, or z, unless somebody has slipped in a new 
one. Thus, of the 26 letters of the alphabet only 6 are allowed to occur after h at 
the beginning of a word. These rules, shared by a population, make communica¬ 
tion possible. If a group of people agrees on a set of rules for the construction of 
legal words and abides by these understood rules, then communication occurs 
without the problems that would occur if everyofie had different rules about word 
construction. 


Lexical Memory: Semantic Networks 
Before turning to the study of the arrangement of words into sentences and 
sentences into prose, brief mention should be made of word storage, or lexical 
memory and meaning. Psychologists have long been interested in the meanings of 
words and in the storage of those meanings. 

Perhaps the most noticeable property of words is that they are associated with 
other words. The notion of associations among words is not new, going back at 
least as far as Aristotle, and it was an obvious place for psychology to begin its 
study of word meaning. According to this early approach, the meaning of a word 

is, in some sense, defined by its association with other words. Some have argued 
that the meaning of a word is the constellation of words that are associated with 

it. We have already had a chance to look at some word-association experiments in 
Chapter 11 (see also Deese, 1962; Garskof & Houston, 1963). 

While fruitful, the word-association approach was short lived. However, sever¬ 
al different approaches to the understanding of lexical storage and word meaning 
grew out of the realization that the word-association approach was an oversim¬ 
plification of a very complex situation. 

In Chapter 9 we outlined some of the more modern conceptions of word 
storage, most of which can be subsumed under the heading of semantic-network 
models. Hence, all we need do here is refresh our memory with respect to these 
newer models and point out that they serve as, among other things, conceptions 
of lexical memory. If you will recall, these models included hierarchies (Collins & 
Quillian, 1972), matrices (Broadbent, Cooper, & Broadbent, 1978), feature 
models (Smith, Shoben, & Rips, 1974), and spreading-activation conceptions 
(Collins & Loftus, 1975). 

These newer approaches to the understanding of word storage substantial¬ 
ly advanced our knowledge of language processes and provided better ways 
of thinking about how the information we have in our head is organized (see 
also Anderson, 1976, 1983; Hayes-Roth & Hayes-Roth, 1975; Loftus, 1973; 
McClosky & Glucksberg, 1979). This work also foreshadowed much of the 
recent attention given to sentences and prose, to which we now turn. 

SENTENCES 

Having looked at lexical memory, or memory for words, in Chapters 9 and 11 
and having reviewed this topic in the preceding section, we are ready to move on 
to the next major division in the study of language—the study of sentences. 
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rules 4 . Determiner -*• a, the 

5. Noun — dog, cat 

6 . Verb -»chased 


Figure 13.1 A simple phrase-structure grammar. 

Phrase-Structure Grammar 

The study of sentences properly begins with the study of grammar (see Berwick & 
Weinberg, 1983). A grammar is a set of rules defining legal and illegal sentences. 
Just as there are rules that govern how phonemes and morphemes can be com¬ 
bined to make words, so, too, are there rules that dictate how words can be 
combined into true or acceptable sentences. The purpose of a grammar is to 
specify what strings of words do and do not correspond to legal sentences: “Bob 
kissed Mary” is a legal sentence, but “Alphabet purple run” is not. A sentence 
must have certain kinds of words in certain orders to pass the test of a true 
sentence. 

Many grammars are hierarchical. In them a sentence is first broken up into 
several phrases , or small groups of words centered on nouns or verbs. These 
phrases, in turn, are broken down further (see Figure 13.1). Grammars that work 
in this manner are called phrase-structure grammars. Figure 13.1 shows how the 
sentence “The dog chased the cat” is parsed into a tree structure, or is analyzed 
for its constituent components. 

Actually, the grammar itself is not the tree structure in the upper portion of 
Figure 13.1; rather, it is the list of rewrite rules at the bottom of the figure. These 
rules describe how parts of the sentence can be rewritten as being composed of 
even smaller parts. For example, Rule 1 states that a sentence can be rewritten as 
a noun phrase followed by a verb phrase. This corresponds to the top-level 
breakdown in the tree structure. Similarly, Rule 2 states that a noun phrase can be 
rewritten as, or is composed of, a determiner [a or the) followed by a noun. Rule 
3 states that a verb phrase can be rewritten, or is composed of, a verb followed by 
a noun phrase. This noun phrase, contained within the verb phrase, can also be 
broken down into a determiner and a noun, just like any noun phrase. 
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Rules 4—6 essentially specify the vocabulary of this very limited grammar that 
we are considering. (A “real-life” grammar would obviously have a lot more 
substance than this.) Legal determiners in this limited grammar are a and the , 
legal nouns are dog and cat, and the only legal verb in this grammar is chased. 
The combination of the structure rules (1-3) and the vocabulary rules (4-6) 
allow for a considerable number of legal sentences (for example, “A dog chased a 
cat,” “The cat chased a dog,” and so on). Strings of words that do not follow these 
rules (for example, “A cat followed a dog,” “Chased cat dog a the,”) are not legal, 
according to this limited grammar. x 

The task of the linguist is to develop a grammar, or list of rules, that is capable 
of generating all legal sentences in a given language while not generating any 
illegal ones—quite a task, to say the least. 

Psychological Reality of Phrase-Structure Grammar 
What kind of evidence is there to support the idea that a phrase-structure analysis 
of the sentence is reasonable and correct? In other words, even though we can 
break sentences down into phrases on a logical level, how do we know that 
phrases are psychologically important to language users? Maybe the whole 
phrase analysis is little more than a logical exercise, and perhaps it has little to do 
with how we go about generating and understanding sentences. 

CLICK MIGRATION. There are at least two kinds of experiment suggesting that 
phrases are important and significant to us when we construct and try to under¬ 
stand sentences. In the first type of experiment (click migration), Fodor and Bever 
(1965) superimposed auditory clicks over a tape recording of a sentence. Some¬ 
times the clicks were placed in the middle of a phrase, and sometimes, at the 
beginning or end of a phrase. Both the clicks and the sentence could be heard 
clearly. Subjects listened to the recording and were asked to indicate at what 
points in the sentence the clicks had occurred. Many of the clicks that were placed 
in the middle of a phrase were erroneously reported as having occurred at the end 
of the phrase; the clicks “migrated.” Whereas it is difficult to tell exactly what this 
effect means, it seems to suggest that the subjects were, in fact, processing the 
sentence in terms of phrases. Comprehending a given phrase took precedence 
over the click-perception task, so that perception of the click was in some sense 
delayed until the subject had finished processing that phrase. In a sense, the 
subject was too busy nailing down that phrase to pay attention to the click (see 
also Garrett, Bever, & Fodor, 1966). 

There are some limits on the extent of click migration. For instance, Seitz and 
Weber (1974) found that the effect can be reduced if subjects are allowed to read 
the sentence along with its auditory presentation (see also Reber, 1973). The 
major point to be made, however, is that during ordinary sentence comprehension 
special attention does seem to be paid to major phrases. Trying to understand 
language processing in terms of phrases does seem to make some sense. 

ERROR PROBABILITIES. The second kind of experiment that supports the phrase 
analysis of sentences has to do with transitional error probabilities. Johnson 
(1965) had subjects memorize and later recall sentences such as that in Figure 
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Figure 13.2 The phrase structure of a sentence and the corresponding transitional error 
probabilities obtained by Johnson. (Adapted from Johnson, N. F. The psychological 
reality of phrase-structure rules. Journal of Verbal Learning and Verbal Behavior , 1965, 

4, 469-475. Fig. 1, p. 471, and Table 2, p. 472.) 

13.2. He then compared the phrase structure of the sentence with the probabilities 
of incorrectly recalling the words from the sentence. The numbers below the 
sentence show the transitional error probabilities. For instance, given that tall 
was recalled, the probability of making an error in recalling boy was .05. It is 
clear that high error rates are found when moving from the recall of one phrase to 
the next; the average error rate within the noun and verb phrases is .06, com¬ 
pared with the rate of .12 when crossing from the noun phrase to the verb phrase. 
The click-migration and transitional-error-probability studies argue strongly for 
the psychological validity of phrases. 

Problems with Phrase-Structure Grammars 

Phrase-structure grammars provide useful, convenient ways to think about sen¬ 
tences. And yet there is far more to language than can be accounted for by these 
grammars. They are fine as far as they go, but they do not go far enough. Phrase 
grammars have at least two significant flaws. 

FLAW 1. The first flaw' has to do with the fact that some sentences can have two 
different meanings. Consider the sentence “They are flying planes.” The two 
possible structures for this sentence are shown in Figure 13.3. This is what is 
called an ambiguous sentence (see Reardon & Katz, 1983). The ambiguity in the 
meaning of the sentence comes from the fact that they can refer either to the 
planes or to the people who might be flying them. Thus, one version states that 
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Sentence 



Verb Modifier Noun 


They are flying planes 


Sentence 



Figure 13.3 The two different structures for “They are flying planes.” 

there are people who are in the process of flying planes, whereas the other version 
refers to planes that are meant for flying as opposed to, say, model planes. “They 
are eating pears” is another ambiguous sentence. 

The correct parsing of these sentences clearly depends on which meaning is 
intended. Since phrase-structure grammars deal only with the words in the 
sentence and not at all with what the speaker intended or meant by the sentence, 
they do not adequately handle these sentences. We need to know more about 
what the language user means, or intends to convey, before we can completely 
understand language. 

FLAW 2. The second flaw in phrase-structure grammars is that they do not 
capture the fact that there are many sentences with different structures that 
express the same idea. For example, consider the active—passive distinction. Read 
the following sentences: 

1. The dog chased the cat. 

2. The cat was chased by the dog. 

3. The cat chased the dog. 

Now let’s simply look at the surface structure, or the word orders, in these 
sentences, as does phrase-structure grammar. Sentences 1 and 3 are quite similar 
physically. In fact, they are much more similar physically than are Sentences 1 and 
2. And yet 1 and 2 mean the same thing, whereas 1 and 3 do not. If we restricted 
our attention to these superficial, surface characteristics, such as word order, we 
would make errors in our understanding of the nature of sentences. Phrase- 
structure grammar, with its emphasis on superficial physical characteristics, is not 
enough to understand complex language usage. 

Surface and Deep Structure 

In attempting to overcome the inability of phrase-structure grammar to account 
for these problems, Noam Chomsky (1957, 1965) introduced the important 
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distinction between deep and surface structure. He argued that a sound theory of 
sentence structure must be based not on superficial, surface characteristics, such 
as word order, but on the underlying idea intended by the user of the sentence. 
Although this is somewhat of an oversimplification, the difference between sur¬ 
face and deep structure is the difference between wording and the basic meaning 
underlying that particular wording. The same deep structure (the basic structure 
of the idea) can be expressed in different ways, or involve different surface 
structures. 


Verbatim Versus Meaning Memory 

One obvious question related to this distinction between surface and deep struc¬ 
ture, or wording and basic idea, is this: Which is more easily remembered, the 
wording or the basic idea of a sentence? Sachs (1967, 1974) has done some 
interesting experiments that suggest an answer here. Sachs’ subjects read short 
paragraphs that contained “target sentences.” For example, a paragraph concern¬ 
ing the discovery of the telescope might contain the sentence “He sent a letter 
about it to Galileo, the great Italian scientist.” The subjects were then shown a set 
of sentences and asked to choose the one that was the same as the embedded one. 
They were given the following choices: 

1. He sent a letter about it to Galileo, the great Italian scientist, (same wording) 

2. He sent Galileo, the great Italian scientist, a letter about it. (different word 
order, same meaning) 

3. A letter about it was sent to Galileo, the great Italian scientist, (passive 
transformation, same meaning) 

4. Galileo, the great Italian scientist, sent him a letter about it. (different 
meaning) 

Notice that Sentences 1, 2, and 3 all have roughly the same meaning. Sentence 4, 
however, has a very different meaning from the other three. The results of the 
experiment were quite clear. When the target sentence was embedded in the 
middle of the paragraph, the subjects could identify the correct sentence at a rate 
only slightly better than chance. They were very likely to pick one of the para¬ 
phrases (2 or 3). But the subjects were almost always able to reject the sentence 
with the different meaning (4). In other words, although they were not very good 
at remembering the exact wording of the sentence, they were very good at 
remembering the basic idea. 

It is not so much that we cannot remember the wording. We choose, or prefer, 
to remember the basic idea. This was demonstrated by Wanner (1974), w'ho 
showed that, when subjects are instructed to concentrate on the wording of a 
sentence, wording identification can be quite accurate (see also Keenan, Mac- 
Whinney, 8t Mayhew, 1977). 

Transformational Grammar 

Chomsky, in pursuing this distinction, introduced the idea of transformational 
grammar. According to this notion, deep-structure ideas that people have can be 
expressed as sentences by applying various transformations to the deep-structure 
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content. In other words, by applying different transformations to the deep- 
structure content we can develop different surface structures (different orders of 
words) that express the same idea. 

EXAMPLES OF TRANSFORMATION. Consider the sentence “The dog chased the 
cat.” This is a direct expression of the underlying idea, or deep structure; it is the 
idea we have in mind and is expressed in the active tense. If we wished to express 
this underlying idea in a passive form, we could apply a transformation, as 
follows: ' 

1. Exchange the position of the two noun phrases. 

2. Replace the verb with the phrase “was chased by.” 

If we do this, we have transformed the active “The dog chased the cat” to the 
passive “The cat was chased by the dog.” In performing this transformation (steps 
1 and 2 above) we have altered the surface structure (the order of the words and 
the exact words we used), but we have not changed the deep structure, or the 
underlying idea: “The dog chased the cat.” 

Several transformations can be performed in succession, leading to a wide 
variety of sentences all expressing pretty much the same underlying deep struc¬ 
ture: 

1. “The cat was not chased by the dog.” 

(passive negation) 

2. “Was the cat chased by the dog?” 

(passive question) 

3. “Was the cat not chased by the dog?” 

(passive negative question) 

All of these sentences are derived from the same basic, underlying idea. 

The problems posed by ambiguous sentences, such as “They are flying planes,” 
can be handled by transformational grammars, because these grammars do take 
deep structure into account, as phrase-structure grammars do not. Transforma¬ 
tional grammars do consider the two different deep structures underlying the two 
interpretations of the ambiguous sentence. The two different deep structures are 
expressed by a common surface structure only because the application of trans¬ 
formations to their different deep structures happens to lead to the same sentence. 

MIXED LABORATORY RESULTS. Chomsky’s idea about transformational gram¬ 
mar led to a prediction that was testable in the laboratory. Specifically, as the 
number of transformations required to convert a deep structure into a particular 
surface structure increases, that surface structure should be more difficult to 
comprehend and remember. The more transformations required, the greater the 
complexity of the surface structure and the more difficult the tasks of comprehen¬ 
sion and memory. For example, “The cheese was eaten by the mouse” should be 
more easily remembered than “Was the cheese not eaten by the mouse?” because 
the first sentence involves fewer transformations from the deep structure (“The 
mouse ate the cheese”) than does the second sentence. 
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One technique that can be used to study this hypothesis is called the sentence- 
verification technique (see Carpenter &c Just, 1975; Clark & Chase, 1972; Tra- 
basso, Rollins, &£ Shaughnessy, 1971). In this method subjects are shown a simple 
picture, such as a set of red dots, and a sentence that might describe the picture, 
such as “The dots are red’’ or “Red is the color of the dots.” The subjects’ task is 
to judge whether the sentence is an accurate description of the picture and to push 
a yes or no button as quickly as possible. The sentences presented to the subjects 
varied in derivational complexity and in whether they accurately described the 
pictures. By varying the complexity of the sentences one can determine whether 
complexity affects comprehension as revealed by reaction time. 

The early results looked very promising. A number of investigators reported 
results suggesting that memory for surface structures involving more transforma¬ 
tions would be poor (see Savin &C Perchonock, 1965; Wason, 1961). This rosy 
state of affairs did not last long, however. Since those early studies, a number of 
investigators have reported findings that contradict the idea that number of 
transformations should be correlated with difficulty in memory (see Fodor, Bever, 
& Garrett, 1974). 

Consider, for example, the following sentences: (1) “John is happier than 
Mary” and (2) “John is happier than Mary is happy.” Sentence 1 is easier to 
understand and remember than Sentence 2. And yet Sentence 1 is derived from 
Sentence 2 by a deletion transformation. In other words, these sentences contra¬ 
dict the idea that greater numbers of transformation lead to greater complexity 
and more difficult comprehension. 

Thus, Chomsky’s ideas about transformational grammar, although revolu¬ 
tionizing the field of linguistics, did not provide all the answers. Psychologists, 
while maintaining many of Chomsky’s notions, have moved on to new concerns. 
It is to some of these more recent developments that we now turn. 

HAM and ACT 

Recent years have witnessed an upsurge in theories and computer models having 
to do with the representation of meaning in memory (see Anderson & Bower, 
1973; Cullingford, 1978; Hayes-Roth & Hayes-Roth, 1977; Hayes-Roth & 
Thorndyke, 1979; Kintsch, 1974; Norman & Rumelhart, 1975; Schank, 1975; 
Schank Abelson, 1977; Thorndyke &C Bower, 1974). Because these models and 
experiments are complex and perhaps better suited for study in a graduate-level 
course, we will concentrate on one of the better-known ones. The model we have 
chosen is called HAM, for human associative memory. A newer version of HAM 
has been called ACT (see Anderson, 1976, 1983). If you will recall, we discussed 
Anderson’s ACT theory, which is a mathematically oriented, all-encompassing 
spreading-activation theory of memory, in Chapter 9. For our purposes here a 
discussion of the simpler HAM version will be sufficient to provide the flavor of 
this type of theorizing. 

PROPOSITIONS. HAM is a model of how information is encoded, stored, and 
retrieved. Specific to our concerns here is the fact that HAM stores information 
about the world as propositions. A proposition is the meaning behind words. For 
example, “Leaves are green” is not a proposition. The proposition is the abstract 
idea expressed by these three words; the same proposition could also be expressed 
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by a picture of a green leaf. A proposition is an abstract idea or assertion about 
the world that can be expressed in different ways. In a sense a proposition is an 
assertion about the world that may be true or false. For example, a picture of a red, 
white, and blue leaf or the statement “Leaves are red, white, and blue” both 
express the same underlying (and incorrect) proposition. It is difficult to grasp the 
nature of a proposition, but for our purposes it is best thought of as the meaning 
behind an expression or assertion, regardless of the form of that expression. 
HAM uses propositions, rather than words or pictures, as the units that are 
stored. ' 

INPUT TREES. Let us consider how HAM stores complex propositions in mem¬ 
ory. Consider the sentence “In the park the hippie touched the debutante.” There 
is a complex meaning, or proposition, behind this string of words. HAM stores 
this proposition in what is called an input tree (see Figure 13.4). The contents of 
Figure 13.4 represent how HAM stores a complex proposition. Let us look at this 
stored input tree in detail. Begin at the top. Notice that the first branching refers 
to a distinction between context and fact. In our proposition something happened 
(the hippie touched the debutante) in a certain context (in the park and in the 
past). The context of the event is broken down further into location (the park) 
and time (the past). The fact (the hippie touched the debutante) is also further 
broken down, into subject (the hippie) and predicate (touched the debutante). 
Finally, the predicate is broken down into a relation (touched) and an object (the 
debutante). The argument that the authors of HAM wish to make is that a 
representation of this complex input tree is what is stored in memory. Notice that 
the sentence “In the park the hippie touched the debutante” is not stored. What is 
stored is the abstract meaning of this sentence, and it is stored as a structure of 
elements that bear particular relationships to one another. This is really just a way 
to think about what is stored in our memory when someone reads us the sentence 
“In the park the hippie touched the debutante.” We hear the sentence and store its 
meaning as an abstract proposition expressed by the input tree, rather than in 
terms of the actual words contained in the sentence. 

INCREMENTAL CUING. Now, given that the HAM authors believe that this is a 
good way to think about how a sentence is stored, what evidence is there for or 
against such a conception? Most of Anderson and Bower’s (1973) experiments are 
very complex and require mathematical sophistication. But some of them can be 
described here. In one experiment, for example, subjects first studied 72 sen¬ 
tences. Each sentence contained four elements: a location (such as park), an agent 
(such as hippie), a verb (such as touched), and an object (such as debutante). The 
subjects were than tested for their ability to remember these four elements in each 
of the 72 sentences by an incremental cuing method. In this method the subjects 
first were given one of the four critical elements in a given sentence and asked to 
recall the remaining three. Then they were given two of the four critical words, 
including the one first presented, and asked to recall the remaining two. Finally, 
the subjects were given three of the critical words and asked to recall the 
remaining word. 

For example, on the first test the subjects might be given “In the_the 
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Figure 13.4 Input tree representing the way that the sentence “In the park the hippie 
touched the debutante” is stored according to HAM. (Adapted from Anderson, J. R., and 
Bower, G. H. Human associative memory. Washington, D.C.: Winston, 1973. Figure on 
P- 145.) 

hippie _ _ the_” On the second test they might be given “In the_ 

the hippie touched the .” And so on. 

By using this incremental-cuing technique we can test some predictions made 
by the input tree in Figure 13.4. To retrieve information from the stored tree it is 
assumed by HAM that the subjects start at the bottom with what is given to 
them (for example, the word park or some other word or set of words) and search 
back up from the bottom of the tree, following all the links that have been firmly 
stored. The subjects follow along all existing links, recalling and reporting the 
words found there, until a dead end (a broken or nonestablished link) is encoun¬ 
tered, or the total set of words is recalled correctly. 

For example, assume that a subject is given park and can recall only hippie. 
This means that all the links between park and hippie must be intact. Now, why 
could the subject not recall touched and debutante ? Several things could account 
for these failures. For instance, the predicate link might be broken or weak, 
whereas the relation and object links are strong, as depicted in Figure 13.5. If a 
weak predicate link is causing the absence of touched and debutante , then a 
second cuing, where both park and touched are presented, should reveal this fact. 
In th is case, recall of debutante should occur along with recall of hippie. How¬ 
ever, if the predicate link is strong and the relation and object links are absent, then 
second-level cuing, where both park and touched are presented, would not lead 
to recall of debutante (see Figure 13.6). 

Without going into any greater detail it can be seen how intricate predictions 
and tests of these predictions can be made with HAM. In general, Anderson and 
Bower (1973) are able to present quite a number of data consistent with their 


CHAPTER 13/LANGUAGE 373 

















Figure 13.5 HAM input tree assuming that predicate link is missing. In this case the 
presentation of park alone will not elicit debutante , but the presentation of touch along 
with park should elicit debutante. 

model. As is so often true in psychological theorizing, however, there are also data 
that seem to contradict the theory (see Foss & Harwood, 1975). Regardless of 
what eventually happens to it, HAM characterizes the kind of complex model 
building that is going on in this area (see also Anderson, 1983). 

PROSE 

Thus far we have discussed research and theorizing that is centered on words and 
sentences. But there is obviously more to language than these elements. This 
section describes some of the thinking that has been done about prose, or textual 
material (see Waters, 1983). 



Figure 13.6 HAM input tree assuming that relation and object are missing. In this case 
the presentation of park alone will not elicit debutante, but neither will the presentation 
of park and touch together. 


374 PART FIVE/COGNITIVE PROCESSES 






















Context Sentences 

Let us begin on a simple level, where we consider only a couple of sentences at a 
time. One of the first things investigators in this area found was that our 
understanding of a given sentence can be strongly affected by the sentences that 
surround it in a piece of prose. In other words, sentences do not occur in a 
vacuum. They exist in a context , and that influential context is composed of the 
sentences that surround them. 

DIFFICULTY IN COMPREHENSION. The sentence context surrounding a given 
sentence can affect (1) the ease with which we understand that target sentence and 
(2) the actual nature of our understanding of that sentence. Let us first look at 
some examples of how the difficulty of understanding a given sentence can be 
affected by surrounding sentences. 

Haviland and Clark (1974) and Clark and Haviland (1977) found that the 
second of the following sentences was easily understood: “John took the beer out 
of the truck. The beer was warm.” But Haviland and Clark also demonstrated 
that, by making a very simple change in the first sentence, the time it took subjects 
to understand the second sentence could be increased dramatically: “John took 
the picnic supplies out of the truck. The beer was warm.” In both instances the 
sentence to be understood (“The beer was warm”) was the same, but depending 
on the context it was easy or difficult to understand. 

The cause of the increased comprehension time is clear. When reading the latter 
pair of sentences, it is necessary to infer that beer refers to the picnic supplies 
mentioned in the first sentence. The reader’s store of knowledge must be searched 
to verify the fact that beer is a reasonable example of picnic supplies, and this 
extra processing slows the subject up and delays the response. 

Haviland and Clark have demonstrated that the key to this phenomenon is the 
assumption that a reference to beer has already been made in the first sentence. 
This has been referred to as the Given—Neiv principle. The Given-New principle 
states that in connected discourse each sentence refers to something in a previous 
sentence (beer) and also adds something new (warm). When the connection 
between the sentences is explicit, as in the first pair of sentences, comprehension is 
easy. But when an obvious connection cannot be found, as in the second pair of 
sentences, the subject must work to find the connection. 

Here is another example of how comprehension can be affected by surrounding 
sentences: “John liked Mary’s bike. He took it.” In this pair the second sentence is 
easily understood. But consider “John liked Bill’s bike. He took it.” Here the 
second sentence is difficult to understand. By changing one word in the first 
sentence the second sentence can be made quite ambiguous (see also Bransford Sc 
Johnson, 1972; Dooling Sc Lachman, 1971; Gerrig Sc Healy, 1983; Keenan Si 
Kintsch, 1974). 

NATURE OF COMPREHENSION. Not only can the sentences surrounding a given 
sentence affect how rapidly that target sentence is understood, but the contextual 
sentence can influence what is actually understood as well. For example, Thorn- 
dyke (1976) had subjects read, “The hamburger chain owner was afraid his love 
of french fries would ruin his marriage.” On the surface, the sentence does not 
make sense. However, if this sentence is accompanied by “The hamburger chain 
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owner decided to join Weight-Watchers in order to save his marriage,” our 
understanding of the sentence will change dramatically. 

SEMANTIC INTEGRATION. Another phenomenon that has been discovered is that 
of semantic integration. This refers to the fact that, when we read two or more 
sentences, we are likely to combine them, or integrate them, in such a way that 
our understanding of the situation draws on both sentences. For example, Hayes- 
Roth and Thorndyke (1979) had subjects read two separate stories. One story 
contained the sentence “King Egbert was a dictator.” The second story contained 
“Albert hated all dictators.” Hayes-Roth and Thorndyke found that the subjects 
would integrate these two sentences and draw the inference that “Albert hated 
King Egbert.” In other words, they filled out their understanding of the situation 
by putting the information from two separate sentences together (see also Brans- 
ford & Franks, 1971). 


Longer Strings: Verbal Context 

Most of the studies cited above are concerned with one or two sentences. In a 
continuing effort to study phenomena that more and more closely approximate 
the real world, however, psychologists have begun to study longer and longer 
segments of connected discourse. Context effects appear in longer texts as well as 
in the case of pairs of sentences. In other words, in a narrative, or story, the 
meaning that we derive from any given sentence and from the narrative as a 
whole is strongly affected by the nature of all of the sentences in the narrative. 
Consider the following passage. What is it about? 

The procedure is actually quite simple. First, you arrange things into different 
groups. Of course, one pile may be sufficient depending on how much there is 
to do. If you have to go somewhere else due to lack of facilities, that is the 
next step, otherwise, you are pretty well set. It is important not to overdo 
things. That is, it is better to do a few things at once than too many. In the 
short run this may not seem too important but complications can easily arise. 
A mistake can be expensive as well. At first the whole process will seem 
complicated. Soon, however, it will become just another facet of life. It is 
difficult to foresee any end to the necessity for this task in the immediate 
future, but then one never can tell. After the procedure is completed one 
arranges the materials into different groups again. Then they can be put in 
their appropriate places. Eventually they will be used once more and the 
whole cycle will then have to be repeated. However, this is part of life. 
(Bransford & Johnson, 1972, p. 322) 

It is difficult to say what this passage is all about, because the sentences are so 
vague and ambiguous that they give you little help in your efforts to understand 
the other sentences. In other words, the context provided by all of these sentences 
is weak and unhelpful. Now substitute the words pile of clothing for pile and 
the words washing'machine for facilities , and read through the passage again. 
The sentences will make sense this time, because we have enriched the context a 
bit. Now sentences within the passage will help you to understand the other 
sentences. 
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Visual Context 

Our understanding of narrative material is strongly affected by the verbal context 
in which it is located, as the example above illustrates. But our comprehension of 
a passage of verbal material can be strongly affected by visual contextual factors 
as well. Bransford and Johnson (1972) had three groups of subjects read the 
following passage and then attempt to recall as many of the ideas in it as they 
could. Before proceeding, read through the passage and attempt to recall as much 
of it as you can. 

If the balloons popped, the sound wouldn’t be able to carry, since everything 
would be too far away from the correct floor. A closed window would also 
prevent the sound from carrying, since most buildings tend to be well 
insulated. Since the whole operation depends on a steady flow of electricity, 
a break in the middle of the wire would also cause problems. Of course, the 
fellow could shout, but the human voice is not loud enough to carry that far. 
An additional problem is that a string could break on the instrument. Then 
there could be no accompaniment to the message. It is clear that the best 
situation would involve less distance. Then there would be fewer potential 
problems. With face to face contact, the least number of things could go 
wrong, (p. 392) 

It’s not easy is it? In fact, one of the three groups in the experiment did what 
you just did and was able to recall an average of only 3.6 ideas from a maximum 
of 14. The other two groups were treated differently. One of them was shown the 
picture in Figure 13.7 before it read the passage. The third group was shown the 
picture immediately after it read the passage. In essence, this picture provides a 
visual context in which to try to understand the passage. 

The group that saw the picture before reading the passage benefited greatly; it 
recalled an average of 8 ideas. But, interestingly, the people who saw the picture 
after reading the passage did not benefit; their mean recall was the same as the 
group not seeing the picture at all (3.6). The picture did not just provide helpful 
hints or reminders about what was in the passage. If it had, the group seeing the 
picture after reading the passage would have been helped. This suggests that the 
picture actually improved comprehension in the group seeing it first, which in 
turn, improved recall. 

The situation is not a simple one, because additional research (Anderson &C 
Pickert, 1978) has shown that under some circumstances contextual information 
presented following the reading of a passage can improve recall. In other words, it 
is clear that contextual information, both verbal and visual, can have a strong 
impact on our comprehension of passages, but the intricate details of the effect 
are not yet clear (see also Brewer & Dupree, 1983). 

The Structure of Stories 

Just as psychologists have tried to understand the structure of sentences by 
building grammars and trees, they have also tried to analyze simple stories. For 
example, Kintsch, Kozminsky, Streby, McKoon, and Keenan (1975) developed 
short stories, or texts, similar to the following: “The Greeks loved beautiful art. 
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Figure 13.7 Appropriate context for the balloon passage. [From Bransford, J. D., and 
Johnson, M. K. Considerations of some problems of comprehension. In W. G. Chase 
(Ed.), Visual information processing. Copyright 1973 by Academic Press, Inc. Reprinted 
by permission.] 

When the Romans conquered the Greeks, they copied them, and, thus, learned to 
create beautiful art.” 

One way to think about the structure of this story is to break it down into a list 
of propositions, as follows: 

1. (love, Greek, art) 

2. (beautiful, art) 

3. (conquer, Roman, Greek) 

4. (copy, Roman, Greek) 

5. (when, 3, 4) 

6. (learn, Roman, 8) 

7. (consequence, 3, 6) 

8. (create, Roman, 2) 

Remember that propositions are shorthand ways of noting abstract ideas. 
Thus, ( conquer, Roman, Greek) refers to the abstract idea, in all its complexity, 
that the Romans conquered the Greeks. 
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Figure 13.8 Hierarchical text structure of the Greek passage. Propositions that are related 
to one another, in that they contain one or more common elements, are connected by a 
line. (Adapted from Kintsch, W. Memory and cognition , New York: Wiley, 1977. Figure 
6.9, p. 359. Copyright © 1977.) 

Kintsch (1974) points out that to describe this story as a list of propositions is 
all well and good, but it is not enough because it does not give us a very good idea 
about which propositions are interconnected, or which propositions are related to 
one another. To grasp the overall organization of a coherent story, we must have 
some way of expressing how these listed propositions are related to one another. 
To do this, Kintsch (1977) developed the hierarchical text structure contained in 
Figure 13.8. The basis for this hierarchy is that two propositions are related if 
they contain the same element, or elements. Thus, Proposition 2 is related to 
Proposition 1 because both propositions contain the element art. Similarly, Prop¬ 
osition 4 is related to Proposition 1 because both contain the element Greek. The 
lines drawn between two circles in Figure 13.8 indicate that the connected 
propositions contain common elements. Sometimes one proposition contains all 
of the elements contained in another proposition (for example, Proposition 7 
contains all of Proposition 2). 

The hierarchical text structure in Figure 13.8 is supposed to give us an overall 
picture of the structure of our little story. It contains all of the propositions that 
make up the story, and it shows how all of these propositions are related to one 
another. High-level propositions, or important propositions, are at the top of the 
figure, whereas low-level, or less important, propositions are at the bottom. 

This kind of analysis of the structure of a story has led to a number of 
predictions, which have been experimentally confirmed: 

1. When people recall a paragraph like this, their recall is correlated with level 
of proposition. That is, more high-level propositions (like Proposition 1) are 
recalled than low-level propositions (like Proposition 8). What this is saying 
is that people remember the important facts and tend to forget the details. 
People are more likely to remember that Greeks loved art than they are to 
remember that the Romans copied them, and so on (Kintsch et al., 1975; 
Mandler & Johnson, 1977; Meyer, 1975; Thorndyke, 1977). 

2. When the number of words in a story is kept constant, the reading time for 
the text increases as the number of propositions in the story increases 
(Kintsch & Keene, 1973). In other words, it takes longer to read stories that 
are jam-packed with assertions. 
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3. Using a large number of words or elements [Greek, Roman, art, beautiful , 
and so on), as opposed to referring to the same few things over and over, 
also leads to increased reading time and decreased recall (Kintsch et al., 

1975). 

4. The ability of a word to act as a recall cue increases as the number of 
propositions in which it appears increases (Wanner, 1974). 

Thus, the idea of describing the structure of a story as a list of propositions and in 
terms of a hierarchy relating those propositions to one another seems to be a good 
start. This is not the end of the line, however. A number of investigators are 
building on and expanding this interpretation of prose structure, and the in¬ 
terested reader is referred to Kintsch and van Dijk (1978), Miller and Kintsch 
(1980), and Spilich, Vesonder, Chiesi, and Voss (1979). 

One interesting alternative approach has been adopted by Thorndvke (1977). 
He also proposes that the general structure of a story is very important in terms of 
how well we can remember it. But the structure he developed contained the 
following elements: 

1. the setting , describing characters, location, and time 

2. the themes , providing focus and goals 

3. the plot, involving steps to achieve the goals 

4. the resolution, or final outcome 

The following story contains each of these four elements: 

[1] Circle Island is located in the middle of the Atlantic Ocean [2] north of 
Ronald Island. [3] The main occupations on the island are farming and 
ranching. [4] Circle Island has good soil but [5] few rivers and [6] hence a 
shortage of water. [7] The island is run democratically. [8] All issues are 
decided by a majority vote of the islanders. [9] The governing body is a senate 
[10] whose job is to carry out the will of the majority. [11] Recently, an island 
scientist discovered a cheap method [12] of converting salt water into fresh 
water. [13] As a result, the island farmers wanted [14] to build a canal across 
the island, [15] so that they could use water from the canal [16] to cultivate 
the island’s central region. [17] Therefore, the farmers formed a procanal 
association [18] and persuaded a few senators [19] to join. [20] The procanal 
association brought the construction idea to a vote. [21] All the islanders 
voted. [22] The majority voted in favor of construction. [23] The senate, 
however, decided that [24] the farmers’ proposed canal was ecologically 
unsound. [25] The senators agreed [26] to build a smaller canal [27] that was 
2 feet wide and 1 foot deep. [28] After starting construction on the smaller 
canal, [29] the islanders discovered that [30] no water would flow into it. [31] 
Thus the project was abandoned. [32] The farmers were angry [33] because 
of the failure of the canal project. [34] Civil war appeared inevitable. 
(Thorndvke, 1977, p. 80) 

The statements have been numbered for our discussion and were not numbered 
when the subjects read the story. The first 10 statements describe the setting, the 
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Story Narrative— Narrative— Random 

after theme no theme 


Figure 13.9 Recall of the Circle Island passage for the different experimental conditions. 
(From Thorndyke, P. W. Cognitive structures in comprehension and memory of narrative 
discourse. Cognitive Psychology, 1977, 9, 77—110. Copyright 1977 by Academic Press, 
Inc. Reprinted by permission.) 

next 6 statements outline the goals, the plot is contained in statements 17—31, and 
the resolution is contained in the last 3 statements. 

To show how very important the arrangement of these four elements is in 
determining how well we recall the story, Thorndyke set up four conditions. In 
one condition subjects read the story just as you see it above. In a second 
condition the theme came at the end of the story. In a third condition the 
statement of the goal was omitted entirely. In the fourth and most extreme 
condition the sentences were presented randomly. After they read their version of 
the story, all subjects tried to recall the ideas in the story. 

Figure 13.9 contains the recall data. Clearly, recall was strongly influenced by 
the organization. Recall was best when setting, theme, plot, and resolution 
followed one another in an orderly fashion; random presentation of the sentences 
led to poor comprehension and recall. 

Knowledge Distorts Memory 

It is ironic but true that our general knowledge, or what we know about the 
world, will often introduce distortions into our attempts to remember prose 
material. What we are talking about here is a major context effect. When we are 
presented prose material, it is not presented in a vacuum; it occurs in the context 
of everything else that we have stored in our memory systems. And the content of 
those memory systems colors, or influences, our retention of the newly presented 
material. 

Bartlett’s (1932) early work represents a famous and prime example of the 
distorting effects of our previous knowledge on our attempts to remember new 
materials. Bartlett had subjects read an Indian folktale called The War of the 
Ghosts. He then had them attempt to reproduce the material after intervals of 
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time ranging from very brief to up to 10 years. In some cases he had subjects try to 
recall the same story repeatedly over time. What he found was that the subjects’ 
recall was characterized by deletion, simplification, extrapolation, and invention. 
The subjects took a lot out of the original prose and put a lot of new material into 
it. Bartlett and others have suggested that recall of the material is heavily affected 
by what the subject already knows, or has stored. The subjects left out parts that 
seemed strange or unfamiliar to them. The story was shortened. It became more 
coherent. Less-familiar elements, such as canoe and hunting seals, became the 
more familiar boat and fishing. Subjects added elements entirely absent in the 
original story, such as dark forests and totems. Although some investigators (see 
Cofer, 1973; Spiro, 1977; Zangwell, 1972) have failed to find the same degree of 
distortion reported by Bartlett, most investigators agree that distortion does occur 
and that it is related to what the subject already knows and has stored. 

Construction Versus Reconstruction 

One of the questions that has intrigued investigators in this general area has to do 
with when the distortions described by Bartlett occur. Do they occur at the time 
the material is originally being presented? Or do they occur when the subject is 
attempting to recall the material? Changes that occur during initial presentation 
of the material are usually called constructive changes, whereas changes occur¬ 
ring during recall are called reconstructive changes. In all likelihood both con¬ 
structive and reconstructive changes occur. It would be an oversimplification to 
assume that all the observed distortions are either constructive or reconstructive. 
The available evidence suggests that both occur (see Hanawalt & Demarest, 
1939; Keenan & Kintsch, 1974; McKoon Sc Keenan, 1974). 

Here is an example of the way psychologists have established that some 
distortions occur at the time of recall. In an early study Carmichael, Hogan, and 
Walter (1932) presented materials such as those contained in Figure 13.10 to 
subjects. Before each picture was presented, the experimenter said, “The next 
figure resembles . . .” and then gave one of the two descriptions listed in Figure 
13.10. One group of subjects received one description, whereas another group 
received the other. Subjects then tried to reproduce the figure they were shown. As 
can be seen in Figure 13.10, the subjects’ drawings were strongly influenced by the 
verbal labels that had been provided. In a sense this is a context effect, in which 
the subjects’ knowledge influenced what they remembered. 

The results do indicate that distortion occurs and that it is influenced by a 
subjects’ knowledge, but they do not clearly establish when the distortion occurs. 
It could occur constructively (when the material is presented) or reconstructively 
(when the drawing is recalled). So Hanawalt and Demarest (1939) did the 
experiment over again with the added condition that the verbal descriptions were 
not given during the learning phase of the experiment. They were given only at 
the time of recall. The researchers still found distortions occurring. Although 
this experiment does not completely clear up the issue, it does indicate that at 
least some reconstructive distortion occurs; distortion is not all of the construc¬ 
tive type. 


Schemata 

To account for these distortions and changes that people introduce into prose 
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Figure 13.10 The effects of verbal labels on the retention of visually presented figures. 
Subjects received either one or the other of the alternative descriptions and then 
attempted to reproduce the figures in the middle column. Sample reproductions in the 
extreme left and right columns illustrate the impact of verbal labels on retention. 
(Adapted from Carmichael, L. C., Hogan, H. P., and Walter, A. A. An experimental 
study of the effects of language on the reproduction of visually perceived forms. Journal 
of Experimental Psychology, 1932, 15, 73—8 6. Figure 27, p. 80. Copyright 1932 by the 
American Psychological Association.) 


material that they are trying to remember, Bartlett introduced the idea of 
schemata (plural of schema). A schema , or a frame, as it is called by some modern 
investigators, refers to the organized body of information an individual has about 
some action, concept, event, or other segment of knowledge. For instance, a 
reader might have a schema that organizes everything that reader knows about 
war —it requires weapons, people are killed in battles, and so on. People can have 
schemata representing knowledge about real estate, psychology, bathing suits, 
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and justice. In other words, any body of knowledge, organized around a central 
core concept, represents a schema. 

According to Bartlett, distortions in memory occur because people interpret the 
incoming story in terms of existing schemata. In a sense the schema is the context 
in which the story occurs, and the subject’s memory of that story is influenced not 
only by the story material itself but also by the schema. 

When people read The War of the Ghosts, they were likely to leave out details 
that sounded strange to them and to alter the story so that it conformed to stories 
they had already heard. They made the story conform to existing schemata. Thev 
were likely to add elements of war with which they were familiar and to delete 
elements of war that were unfamiliar to them, or did not occur in their existing 
war schema. Everything not in their existing war schema would be likely to be 
dropped out. Everything in their war schema not in the story would be likely to be 
added. The longer the delay between presentation and recall, the more likely the 
subject is to put in these details from existing schemata (see Sulin & Dooling, 
1974). 

Stories tend to be constructed differently in different cultures. Thus, we have a 
story schema that contains our ideas about what a story is. But Native Americans 
had a different story schema. Hence, when we try to recall an old Indian story, we 
distort it so that it is brought more into line with our ideas about what constitutes 
a story. This cultural aspect of story telling was demonstrated by Kintsch and van 
Dijk (1975). They gave subjects either stories from Boccaccio’s Decameron, 
whose structures are not unusual to American subjects, and Indian folktales, 
which are written with a very different and unfamiliar structure. These authors 
found that the subjects’ summaries of the Decameron stories were much more 
accurate than their summaries of the Indian tales. The lack of appropriate 
schemata for the Indian stories made their accurate retention difficult. 

Anderson and Pickert (1978) provide another interesting example of the im¬ 
portance of schemata in prose memory. Subjects read a rather neutral description 
of a house from either of two different viewpoints; they pretended they were 
either a potential house buyer or a burglar. Then they attempted to recall the 
description of the house. Once they got to the point where they could recall 
nothing more, they were asked to recall whatever they could given the alternative 
viewpoint. The subjects who had first recalled from the home buyer’s viewpoint 
then tried to recall from a burglar’s viewpoint. Similarly, the burglar became a 
home buyer. Recall went up! When the subjects shifted perspective, they were 
able to recall more details that were unavailable when they were operating from 
the first perspective. The newly made burglar suddenly remembered that the door 
was hidden by tall bushes, while the newly made home buyer suddenly remem¬ 
bered that the living room was large. Using a new schema (adopting a new 
viewpoint) allowed access to information that was not easily available under the 
original schema. 


A “Common Cents” Schema 

Rubin and Kontis (1983) have presented some data that indicate that people have 
a very general schema for U.S. coins (penny, nickel, dime, quarter) that is different 
from each of the individual coins. They gave subjects some empty circles and 
asked them to fill in the details of the four U.S. coins. The subjects were told that 
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Figure 13.11 Modal and actual coins. (From Rubin, D. C., and Kontis, T. C. A schema 
for common cents. Memory and Cognition , 1983, 11, 335—341. Fig. 1 , p. 336, and 
Fig. 2, p. 338.) 

the experimenters were more interested in their knowledge of the coins than in 
artistic ability and that they should fill in as much detail as possible. Once this was 
done, the experimenters determined which inscriptions, placements, and orienta¬ 
tions had been provided most often by the subjects. A pictorial representation of 
these results is contained in Figure 13.11. The actual U.S. coins are presented on 
the right side of the figure for comparison. But look at the coins on the left. These 
are the modal coins drawn up by the authors. There are two important things to 
notice. First, none of the modal coins based on subjects’ recall corresponds 
accurately with the actual coins. Second, all four modal coins have a great deal in 
common. All profiles face to the left. “In God we trust,” is at the upper edge. The 
date is always in the lower right quadrant. The value of the coin is at the bottom. 
In other words, we seem to have a general conception, or schema, of U.S. coins 
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that strongly affects what we will do when we try to reproduce the coins. Once 
again, distortions in memory occur because we interpret incoming information 
(the request to draw a coin) in terms of our “coin schema.” 

The reader interested in pursuing schemata is referred to Bobrow and Norman 
(1975), Bower, Black, and Turner (1979), Dooling and Lachman (1971), Kozmin- 
sky (1977), Minsky (1975), Rumelhardt and Ortony (1977), and Schank and 
Abelson (1977). 


Courtroom Testimony 

Loftus and Palmer (1974) report another interesting example of how our under¬ 
standing of a situation can be affected by context. They showed subjects a Hint of 
a traffic accident and then asked them one of two questions: “How fast were the 
cars going when they bumped ?” or “How fast were the cars going when they 
smashed ?” (emphasis added). Then the subjects were dismissed with instructions 
to return in one week. At this second session it was found that the subjects who 
had been asked the smashed question during the first session were very likely (and 
incorrectly) to say that they had seen broken glass in the film. In other words, the 
context sentence influenced their understanding of the situation (see Bekerian & 
Bowers, 1983; Weinberg, Wadsworth, &C Baron, 1983). 

Obviously, this kind of result has important implications for courtroom testi¬ 
mony. It suggests that leading questions can strongly affect what an eyewitness 
will report having seen. False reports may occur, depending upon how questions 
are asked. The problem is not easily resolved. As Reed (1982) puts it: 

A judge can immediately rule leading questions out of order, but not before 
the members of the jury have heard the question. Instructions from the judge 
to disregard certain evidence may not prevent the jury from considering that 
evidence when making their decision. More subtle uses of language, such as 
use of the word crash rather than the word hit, may not even be identified as 
potentially misleading, (p. 250). 

APES AND LANGUAGE 

We have come a long way in our investigation of language. Starting with language 
development, we have worked our way through what is known about words, 
sentences, and stories. But there remains one intriguing area of investigation— 
that of language in apes. Clearly, when we study ape language we are not going to 
get to the point where we read stories to them and then investigate the schemata 
they use in trying to recall the material. But because our concern throughout this 
book has been the continuity of human and animal research, we do want to ask 
the most basic question: Can apes use language at all? 

Although the topic is controversial, with some arguing that apes do not develop 
true language (see Seidenberg & Petitto, 1979; Terrace, 1979; Ward, 1983), the 
accumulating evidence is fairly convincing: It appears that apes may be able to use 
at least the rudiments of language. They do not use it as ingeniously and as 
creatively as humans-, but they do seem capable of using language on a level 
roughly equivalent to that displayed by humans in their earliest stages of language 
development. 

In early studies of language and apes (see Hayes & Hayes, 1952) an effort was 
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Box 13.1 Study Suggests That Monkeys Can “Talk” 

Anyone who has ever owned a pet has wondered if animals can “talk.” 

Monkeys can, according to Dr. Peter Marler, a zoologist. 

Marler bases his belief on seven years of experiments on monkey communities in 
East Africa. He has determined that the warning cry emitted by a monkey when 
attacked by a leopard differs from the cry given when an eagle attacks. 

“A monkey will never give an eagle alarm when, say, an elephant walks by,” he 
said. “It will give an elephant alarm cry” 

Marler’s findings suggest that a primitive form of language exists in a 
non-human society. He believes that when the monkeys cry out, “they are 
expressing their thoughts, as well as their feelings.” This challenges widely held 
beliefs that animal noises merely express emotions such as fear and passion. 

Marler, of Rockefeller University, spoke recently at the American Society of 
Primatologists meeting at Michigan State University. 

“It seems as if, when an animal makes an eagle alarm call, it evokes the image of 
an eagle in the minds of the other monkeys,” Marler said. “When you play back a 
recording of an eagle alarm call, they look up into the sky.” 

If a leopard is encountered and an alarm sounded, he explained, “the monkeys 
escape to the tops of the trees. This is the right thing to do because the leopard 
hunts down below in the dense bush. But if the eagle is the predator, the monkeys 
flee into the dense bush.” 

Marler and two doctoral assistants began observing colonies of vervet monkeys 
near Nairobi, Kenya, in 1976. They made audio and video recordings and 
established that the monkeys had specific warning cries for different predators. 

“We had lots of experiences with animals in nature that didn’t jibe with the 
traditional view,” he said. “It seems they are using the alarm calls as if they were 
the names of animals. When we played the tapes back and the predators were not 
around, they evoked their escape mechanisms. 

“Clearly there is an objective relationship between the predator call and the 
predator.” 

Now Marler is studying rhesus monkeys. He has found that they also have 
sounds for specific dangers encountered within a monkey colony. 

From Hugh McCann, Los Angeles Times , November 30, 1983, Part 1A, p. 6. Copyright, 1985 The Detroit News. 


made to teach chimpanzees to speak. Unfortunately, this approach was very 
frustrating because the animals could learn to produce only a very few words, 
such as cup and up, and these only with great difficulty. As it finally turned out, 
the problem was not so much that this was all the language the animals could 
learn but that they had trouble vocalizing. It was trying to speak that was the 
problem. (But see Box 13.1.) 

Gardner and Gardner (1971, 1975a, 1975b, 1978) hit on a new idea. If the 
chimpanzee could not speak, perhaps it could use sign language. Sure enough, 
their now famous chimpanzee, named Washoe, was able to learn and use about 
150 signs of the American Sign Language. She was able to master simple sen¬ 
tences, such as “Hurry gimme toothbrush” and “You me go out hurry.” When she 
was late, she sometimes signed “hurry” to herself. She used to swear at other 
animals and once called a rhesus monkey a “dirty monkey” (Linden, 1975). 
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Clearly, Washoe never did, and never will, acquire the ability to use language in 
the manner that adult humans do. But she was not entirely without language, 
either. She did display some creative language use. For example, once she had 
learned more to request more tickling, she would use it in other situations, such as 
“More food” or “More juice.” In fact, she actually invented a few signs, such as 
one for bib , that she had never seen before! Many chimps have now been taught 
sign language, and many of them seem to display a degree of inventiveness (see 
Fouts & Couch, 1976; Linden, 1975; Patterson, 1978). 

An alternative to the use of sign language has been developed by Premack 
(1970, 1976), who used pieces of colored plastic of different shapes to represent 
the components of language rather than hand movements. A chimp named Sarah 
would place these symbols on a board to correspond roughly with the word order 
in a sentence. She was able to master and use a set of plastic symbols well over 
100 in number. 

The investigation of language in apes is progressing rapidly. For example, 
Roger Fouts (Fouts, Hirsch, & Fouts, 1982) has reported some intriguing results. 
He has found that two chimps proficient in sign language will sign messages to 
each other. He also reports that not only will a young chimp imitate an older 
chimp’s use of sign language but also the older chimp will actively teach sign 
language to the younger animal. 

In summary, even though serious reservations exist (see Pate &C Rumbaugh, 
1983) and we will have to await further research, it appears at present that 
chimps can use language on a level with the least experienced human being. Thus, 
the gap between the illustrious human and the lowly animal may be smaller than 
has long been believed. 


SUMMARY 

1. Language enables us to communicate and to stockpile information. It may 
facilitate thinking, as well. 

2. All human babies display spontaneous babbling early in life. This babbling 
appears to include all of the sounds necessary to speak any language, even 
though the child quickly begins to concentrate on the sounds contained in the 
language of his or her environment. 

3. True words and holophrastic speech tend to be picked up quickly in the first 
few years of life. Two-word sentences and telegraphic speech follow. 

4. Although children vary in terms of how quickly they acquire language, they 
all appear to acquire grammar in the same order. 

5. Early theories of language acquisition based on conditioning principles have 
been replaced by the psycholinguistic approach, which argues that language 
usage is genetically based and involves hypothesis testing as well as creative 
thinking. 

6. Spoken languages are composed of basic sound units called phonemes. These 
basic sounds are combined to form morphemes , the smallest meaningful unit 
in language. Morphemes are combined to form words. 

7. Early efforts to study word storage focused on word-association tasks. More 
recently, word storage has been thought of in terms of hierarchies, matrices, 
feature comparisons, and spreading activation. 
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8. Phrase-structure grammar analyzes sentences in terms of phrases, or groups 
of words centered on nouns or verbs. 

9. Phrase-structure grammar can be depicted in terms of tree structures and 
rewrite rules. 

10. In click-migration studies, clicks superimposed over sentences are perceived 
as occurring at the end of a phrase even though they actually may have 
occurred in the middle of the phrase. This effect has been taken as evidence 
for the psychological reality of phrases. Transitional error analyses offer 
further support for phrase-structure grammar. 

11. Phrase grammars are limited, because they do not account for such things as 
ambiguous sentences and do not capture the fact that many different sen¬ 
tences express the same idea. 

12. Surface structure refers to the wording of a sentence, whereas deep structure 
refers roughly to the idea behind the sentence. 

13. The meaning of a sentence tends to be remembered better than its exact 
wording. 

14. According to transformational grammar , different surface structures can be 
developed from the same deep structures through the application of various 
transformations. Several transformations can be performed in succession, 
leading to a wide variety of sentences. 

15. Sentences that are the result of greater numbers of transformations are 
sometimes, but not always, more difficult to comprehend and remember. 

16. HAM and ACT are computer-based models of how information is encoded, 
stored, and retrieved. HAM stores propositions, not sentences. 

17. Input trees describe the manner in which HAM stores propositions. Using 
the methods of incremental cuing, various tests of HAM can be made. These 
tests have sometimes, but not always, been successful. 

18. The meaning we derive from a sentence is affected by the surrounding 
sentences, or context. Context sentences affect not only the difficulty of 
understanding a given sentence but the actual meaning of the target sentence 
as well. 

19. Semantic integration refers to the fact that we combine, or integrate, separate 
sentences so that our understanding is affected by both sentences. 

20. Context effects, both verbal and visual, can occur in longer pieces of prose as 
well as in the case of two or three sentences. 

21. The structure of stories can be described in terms of a list of propositions and 
a hierarchical text structure relating those propositions to one another. 
Stories can also be thought of in terms of setting, theme, plot, and resolution. 

22. Bartlett’s early work indicates that what we know about the world can affect 
our efforts to remember any given piece of material. We tend to delete, 
simplify, extrapolate, and invent when we try to recall prose material. 

23. Some errors are constructive , in that they occur at the time the material is 
originally presented. Other errors, occurring at the time of recall, are called 
reconstructive errors. 

24. The organized body of knowledge an individual has about a given topic, 
which influences and distorts recall of prose material, is called a schema, or 
frame. 
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25. Everything in the schema is likely to be added to the story, and everything in 
the story but not the schema is likely to be dropped. Cultural differences in 
schemata exist, and they affect recall. 

26. Using a new schema (adopting a new viewpoint) can improve recall. 

27. Context effects may have important implications for courtroom testimony. 

28. Apes appear to be able to use the rudiments of language. Although they have 
great difficulty in speaking, they can use sign language and plastic symbols in 
place of speech. When they do so, they appear to be able to use language 
about as well as a human who is just beginning to develop language use. 
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Accessibility. See Item accessibility 
Acetylcholine (ACh), 182 
Acoustic coding, 274-276 
Acquired equivalence of cues, 196 
Acquisition, 109 
Activity drive, 177 
Acts, defined, 133 
ACT theory, 268, 371 
Adaptation, 33 
Age regression, 132-133 
Agoraphobia, 51 
Alcoholism 

aversion therapy and, 45, 92 
stimulus generalization and, 76 
Algebraic summation theory, 199-201 
All-or-none learning, 122-123 
versus gradual learning, 129-137 
Guthrie’s and Estes’ approaches, 
133-135 

Alpha response, 30 
Ambiguous sentences, 367-368 
Amnesia 

retrograde, 236-237 
reversible, 237-238 
Amount-of-reward variable, 169 
Animal memory, 288—293 
DMTS and animal STS, 289-290 
interference and, 288-289 
serial-pattern learning, 292-293 
spatial memory, 290-292 
stimulus selection and, 309-310 
Anorexia nervosa, 39 
Assertion training, 48-50 
Associative learning, 207 
Atkinson-Shiffrin buffer model, 246—248 
Attention, stimuli and, 115 
Attributes 

attribute learning, 332-333 
defined, 331-332 

Auditory sensory memory, 248, 249 
Autism, 36 

Automatic processing, 145-146, 293-294 
Autoshaping, 84, 162-163 
Availability. See Item availability 
Averaging data, 125-129 
Aversion therapy, 45-46 
bait shyness and, 91-92 
Aversive conditioning, 24 
Avoidance training, 39-43, 89 
escape training and, 44 
flooding, 40 

learned helplessness, 41-43 


punishment training and, 46 
two-factor theory of, 40-41, 92 

B 

Babbling, 361 

Backward classical conditioning, 27 
Bait-shyness effect, 32, 88—92 
aversion therapy and, 91—92 
avoidance conditioning and, 89 
control of predation and, 91 
CS-UCR interval and, 120 
preparedness and, 89-90 
Behavior 
fixated, 206 

Hull’s theory of, 129-133 
schedule-induced, 161-162 
Behavioral contrast effect, 203-204 
Behavioral medicine, 65-66 
Behaviorism, 129 
concept formation and, 335 
Behaviorist language, 114 
Behavior modification, 47—51 
generalization and, 197 
Biofeedback training, 64—66 
application of techniques, 65 
behavioral medicine, 65-66 
unprepared human learning and, 87 
Blank trial method, 338—340 
Blocking effect, 35, 120-121 
Blood pressure 

biofeedback and, 64 
classical conditioning and, 35 
Boredom drive, 177 

c 

Catecholamine, 183 
CCCs, 20 

Chained schedules, 160-161 
Choice test, 172 
Cholinesterase, 182 
Chunking. See Memory chunk 
Classical conditioning, 7-10 
aversive, 24 

compared with instrumental 
conditioning, 57—67 
in everyday life, 9—10 
eye blink, 24 

interference effects and, 228 
language acquisition and, 362 
masochism, 25-26 
phobias, 26 

practical uses of, 35—36 
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Classical conditioning (cont.) 
secondary reinforcers and, 173 
semantic, 25 
stages of, 9 

temporal spacing of CS and UCS, 26-27 
Clustering in recall, 305—308 
exhaustive versus nonexhaustive 
categories, 306-307 
subjective organization (SO), 307—308 
Cognition, approaches to, 325—327 
Cognitive behavior modification (CBM), 
47-51 

Cognitive mediation, 62—63 
Cognitive processes, 325 
Competition 

forgetting and, 228-229 
specific versus generalized, 231-232 
versus unlearning, 229—230 
Compounding, 34 
Compound stimulus effects, 33—35 
Computers, problem solving and, 352 
Concepts 
animals and, 345 
defined, 17 

discovering attributes and rules, 

331-334 

feature-frequency and, 344—345 
formation of, 17—18, 330—331 
hypothesis-testing interpretation of 
concept formation, 335—340 
natural categories and, 340—343 
prototypes and, 343—344 
S-R interpretation of concept formation, 
334-335 

Concurrent schedules, 161 
Conditioned excitement, 58 
“Conditioned illness,” 89 
Conditioned inhibition, 34 
Conditioned response (CR), 7—8, 24 
test trials and inhibition of delay, 28-29 
UCR and, 29 

Conditioned stimulus (CS), 7, 20 
CS-UCS interval, 31—32 
intensity of, 32 
serial compound, 32 
Conditioned suppression, 34 
Conditioning 
classical, 7-10 

compound stimulus effects, 33—35 
defined, 6—7 

examples of classical, 24-26 
instrumental (operant), 10—11 
Conservative focusing, 338 
Consolidation hypothesis, 236—238, 288 
Consummatory theory, 179 
Context effects, 294 
courtroom testimony and, 386 


Contextual interpretation, 138 
Contiguity, 114 

CS “blocking” and, 120-121 
CS-UCR interval and, 119—120 
informational theory and, 121 
paired-associate learning and, 118-119 
short-term memory and, 121—122 
subjective organization and, 117—118 
Continuity theory, 199-201, 203 
Continuous reinforcement (CRF), 155 
versus partial reinforcement, 159 
Continuum of preparedness, 83 
imprinting and, 104—105 
Controlled processing, 145—146, 293—294 
Control processes, 253-256 
cultural differences in, 256 
Corpus callosum, 295, 296 
Counterconditioning, 73 
CS preexposure effect, 33 
CS-UCS interval, 31-32 
Cue selection, 308 

Cumulative response curves, 127-129 
CVCs, 20 

D 

Decay theory, 235—236, 252 
STS and, 273 

Delayed classical conditioning, 27 
Delay of reinforcement, 154, 165—168 
Delay-of-reinforcement gradients, 165 
Delayed-matching-to-sample procedure 
(DMTS), 289-290 

Depression, learned helplessness and, 42 
Differential reinforcement, 198 
Directed forgetting, 256 
Discrimination, defined, 187 
Discrimination hypothesis, 163—164 
Discrimination learning, 11—14, 77 
algebraic summation interpretation of, 
199-201 

behavioral contrast and, 203-204 
concept formation and, 334-335 
errorless, 204—205 
in humans, 207-209 
insoluble discrimination problems, 
205-207 

noncontinuity theory, 202—203 
preparedness in, 84—85 
relational theory of, 201 
response differentiation, 209—210 
simultaneous versus successive methods 
of investigating, 199 
transposition, 201-202 
Distinctiveness hypothesis, 144 
evidence for, 258-259 
Distractor technique, 272, 289 
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Distributed practice (DP), 137-140 
Disuse theory. See Decay theory 
Drive-reduction theory, 177 
Dual-trace hypothesis, 315, 316 
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Echoic memory, 250—251 
Elaboration hypothesis, 144 
evidence for, 258—259 
Elaborative rehearsal, 144—145, 146 
Electrical brain stimulation, 132, 180-181 
Encoding, 242, 243 
dual-trace theory of, 315-316 
Encoding specificity principle, 259 
Encoding variability, 138-140 
Endorphins, 43, 58, 181 
Enuresis, 35 
Episodic memory 

flashbulb memories, 286-287 
mixed memories, 287 
versus semantic memory, 284-287, 
302-303 

tip-of-the-tongue phenomenon, 

285-286 

Errorless discrimination learning, 204-205 
Escape training, 44 
Ethology, 94—97 

imprinted behavior and, 100—101 
Eyebrow-flick response, 85—86 
Expectancy approach, 35 
Experimental neurosis, 206 
Exploratory drive, 177 
External inhibition, 75 
Extinction, 40, 68-73 
as acquisition, 70 
generalization of, 196—197 
partial reinforcement and, 163-164 
systematic desensitization, 71—73 
of tantrums, 70-71 
unlearning as, 230—231 
Extinction test, 172 
Extinction trial, 28 

Eye blink conditioning, 24, 32, 57, 59 

F 

“Family resemblances,” 342—343 
Fear. See also Phobias 
in classical and instrumental 
conditioning, 58 

classical conditioning and, 35—36 
imprinting and, 102 
two-factor theory and, 40—41 
Feature models, 264-266, 364 
Features, defining versus characteristic, 

264 

Fixated behavior, 206 
Fixed action pattern, 95, 101 


Fixed-interval (FI) schedule, 155-156 
Fixed-ratio (FR) schedule, 157-158 
Flooding, 40 
Forgetting 
directed, 256 
extinction and, 70 
proactive inhibition and, 224 
two-factor theory of, 228-232 
Free recall, 16—17, 137-138 
Frequency units, 208 
Frequency theory, 208-209 
Frustration, behavioral contrast effect and, 
204, 205 

Frustration theory, 164 
Functional fixedness, 348, 352 

G 

Galvanic skin response (GSR), 77 
Generalization, 187, 190-197 
behavior modification and, 197 
of extinction, 196-197 
primary and secondary, 195-196 
role of experience in, 192-195 
transfer of training and, 215-216 
variables affecting, 191-192 
Gestalt psychology, 301 
problem-solving and, 346-348 
Given-New principle, 375 
Global attributions, 42 
“Go” center, 180 
Gradual learning, 122—123 
versus all-or-none learning, 129-137 

H 

Habituation, 33 
Heart rate 

breathing patterns and, 63 
reinforcing increases and decreases in, 
61-62 

Hierarchies, 260-264, 364 
Higher-order conditioning, 34 
Holophrastic speech, 361-362 
Hope, as a conditioned response, 58 
Human associative memory (HAM), 
371-374 

incremental cuing and, 372—374 
input trees and, 372 
propositions and, 371—372 
Hypnotic age regression, 132—133 

I 

Iconic memory, 249—250 
Images. See Mental images 
Imprinting, 97—105 
experiments, 97—99 
humans and, 105 
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Imprinting ( cont .) 

as innate behavior, 100-103 
interaction models, 103 — 104 
natural versus laboratory, 99-100 
nonvisual, 100 

preparedness continuum and, 104—105 
Incentive motivation, 58, 131 
Incidental learning, 141 
Incremental cuing, 372—374 
Independent retrieval phenomenon, 233 
Informational theory, 35 
S-R contiguity and, 121 
Information processing tradition, 238 
Inhibition, 131 
Inhibition of delay, 29 
Innate behavior, 94-97 
hydraulic model of, 96—97 
imprinted behavior as, 100—103 
Innate releasing mechanism, 95 
Innate responses, 94 
Input trees, 372 
Instinctive drift, 93-94 
Instrumental conditioning, 10—11 
autoshaping and, 162—163 
compared with classical conditioning, 
57-67 

language acquisition and, 362 
preparedness in, 83—84 
Instrumental reward training, 36—39 
Interactive images, 316—318 
“Interchangeable-parts” conception of 
learning, 82—83 

Interference, 224—235, 245, 252 
animal memory and, 288—289 
generality of, 228 
imagery and, 312-314 
proactive inhibition, 224-226 
retroactive inhibition, 226-227 
in STS, 272-274 
Involuntary responses, 60-62 
Item accessibility, versus item availability, 
234-235 

Item availability, versus item accessibility, 
234-235 

Item displacement, 246-247 

j 

Journal of Applied Behavior Analysis , 38 

K 

Kamin’s blocking effect, 34 
Keyword method, 322 

L 

Lag effect, 138 
Language 

apes and, 386-388 


development of, 360-362 
human associative memory, 371—374 
importance of, 360 
prose, 374-386 
sentences, 364—371 
theories of language acquisition, 
362-363 
words, 363—364 
Language learning, 56 
Lashley jumping stand, 13, 205 
Latent inhibition, 33 
Learned helplessness, 41-43 
Learning. See also Discrimination learning 
biological constraints on, 90 
defined, 4—6, 7 
examples of, 6—21 
Guthrie’s and Estes’ approaches, 
133-135 

Hull’s approach, 129-133 
importance of learning studies, 21-22 
multiple-trace versus strength models, 
136-137 

psychology and, 20 
role of rehearsal in, 122—123 
verbal-discrimination (VD), 207-209 
Learning curves, 123—129 
Learning tasks, 54—78 

comparison of classical and instrumental 
conditioning, 57—67 
language learning, 56 
paired-associate learning, 54—56 
priming, 56—57 

Levels of processing, 140-144, 243-244, 
256-260 

criticism of, 142-143 
elaboration and distinctiveness 

hypotheses, 143-144, 258-259 
encoding specificity and, 259-260 
Lexical memory, 17, 260 

feature models and, 264—266 
hierarchies and, 260—264 
matrices and, 264 
semantic networks and, 364 
Lexical priming, 17 
Lie detector, 37 

Long-term store (LTS), 245, 246, 

252-253 

control processes and structural 
properties and, 253—256 
versus short-term store, 272, 278-280 

M 

“Mail-checking” behavior, 156 
Maintenance rehearsal, 144-145 
Manipulation drive, 177 
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Masochism, 25-26 
Massed practice (MP), 137—140 
Matching law, 161 
Matrices, 264, 364 
Maximizing hypothesis, 161 
Mediation, 62-63 
“mental” mediators, 120 
Meditation, involuntary systems and, 64 
Memory 

animal versus human, 288—293 
automatic versus controlled processing, 
293-294 

components of, 242—243 
contextual clues and state-dependent 
memory, 294—295 
control processes and structural 
properties of, 253-256 
electrical brain stimulation and, 132 
episodic versus semantic, 284-287 
hypnosis and, 132—133 
information processing approach to, 
243-244 

mental images and, 310—315 
for pictures, 316—317 
primary and secondary, 244—246 
for prose, 381—384 
recall versus recognition, 280—284 
separate store issue, 272—280 
split-brain experiments, 295—296 
structural approach to, 300—301 
trace conditioning and, 120 
Memory chunk, 303—305 
problem-solving and, 354 
Memory, improving, 300—310 
attentional strategies, 308—310 
chunking and rewriting, 303-305 
mnemonics, 318—322 
recall by category, 305-308 
word associations, 301—303 
Memory models, 224—238, 242-268 
consolidation theory, 236—238 
decay theory, 235-236 
interference theory, 224—235 
levels of processing approach, 256—260 
semantic-network models, 260—268 
separate-store models, 243—256 
Mental images, 310—315 
interactive, 317—318 
interference and, 312—314 
mental rotation, 311—312 
perceptual-experience interpretation of, 
314 

propositional interpretation of, 314 
scanning experiments, 310—311 
Mental rotation studies, 311-312 
Method of loci, 320—322 
Mixed schedule, 159—160 


MMFR (modified, modified free-recall) 
test, 230 

Mnemonics, 318-322 
inventing stories, 320 
keyword method, 322 
the method of loci, 320-322 
rhyming, 319—320 
Monotonic group, 292 
Morphemes, 363—364 
Motivation, 5 
learning and, 123 
Movements, defined, 133 
Multiple-response measures, 32—33 
Multiple sensory channels, 100 
Multiple trace model, 136—137 
Muscular mediation, 62—63 

N 

Negatively accelerated performance curve, 
68,123, 127 

Negative recency, 277-278 
Negative reinforcement, 46—47 
Negative stimulus, 11 
Negative transfer, 214 

versus proactive inhibition, 226 
Neural systems, 181 
Neurochemical changes, during 
inescapable shock, 43 
Noncontinuity theory, 202—203 
Noninteractive images, 317 
Nonmonotonic group, 292 
Nonsense syllables, 20—21 
Nonspecific transfer, 211—213 

o 

Operant conditioning, 10-11 
Opoids, learned helplessness and, 43 
Optimal-arousal-level theory, 178 
Orthographical distinctiveness, 259 
Osgood surface, 213-215, 227 
Output-rehearsal hypothesis, 118 
Overlearning reversal effect, 203 
Overshadowing, 34 

P 

Pain, suppression of, 43 
Paired-associate (PA) learning, 16, 20, 
54-56 

contiguity and, 118—119 
Paired-associate list, 16, 17 
Partial reinforcement, 159 
extinction and, 163—164 
generalization and, 192 
Partial-reinforcement effect (PRE), 
163-164 
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Pavlovian conditioning, 7, 25—26 
Performance, defined, 5 
Performance curves, 67-68, 123-129 
Performance measures, 123—125 
Perseverative responding, 206 
Personal attributions, 42 
Phobias, 26 

systematic desensitization and, 71-73 
Phonemes, 363—364 
Phonemic (acoustical) processing, 140 
Phrases, defined, 365 
Phrase-structure grammar, 365—368 
click migration and, 366 
problems with, 367—368 
transitional error probabilities and, 
366-367 
Polygraph, 37 

Positive reinforcement, 46-47 
Positive stimulus, 11 
Postreinforcement pause, 157 
Practice. See Rehearsal 
Praise, 175-176 

Precategorical acoustic storage (PAS), 251 
Predation control, 91 
Preexposure effects, 90 
Preferred-level theory, 178—179 
Prehatching vocal interactions, 100 
Premack principle, 179 
Prepared instrumental conditioning, 84 
Preparedness, 83—87 
bait shyness and, 89—90 
continuum of, 83 
in discrimination learning, 84—85 
in humans, 85—87 
in instrumental conditioning, 83—84 
Primacy effect, 276 
Primary generalization, 195-196 
Primary memory (PM), 244—245, 246, 
273 

Priming, 56—57 

level of processing and, 141—142 
spreading-activation model and, 
266-267 

Proactive facilitation, 225—226 
Proactive inhibition (PI), 224-226 
STS and, 273 
Problems, defined, 19 
Problem solving, 19-20, 330, 345 
algorithms and, 349 
animals and, 356 
computer simulation of, 352 
Gestalt interpretations of, 346—348 
heuristics and, 349-350 - 
knowledge and, 353—355 
planning and, 355 
protocol analysis and, 351-352 
step analysis of, 348-349 


transfer in, 352—353 
working backward and, 350-351 
Processing 

controlled versus automatic, 145—146 
levels of, 140—144 
Procrastination effect, 157 
Propositions, 371—372, 378 
Prose, 374-386 

constructive versus reconstructive 
qhanges in recall of, 382 
context sentences, 374—376 
distortions in memory of, 381—382 
schemata and recall of, 382—384 
the structure of stories, 377—381 
verbal context and, 376 
visual context, 377 
Protocol analysis, 351—352 
Pseudoconditioning, 30—31, 33 
Psychosomatic illness, 63 
Punishment training, 44-46 

R 

Raynaud’s disease, 65 
Reaction-specific energy, 95 
Recall, versus recognition, 280—284 
control processes and, 283—284 
generation-recognition, 281—283 
recognition failure, 284 
unitary strength, 281 
Recency effect, 276—277 
Recognition, 280. See also Recall 
Recognition-failure phenomenon, 284, 285 
Rehearsal, 122—123 
levels of processing, 140—144 
massed versus distributed practice, 
137-140 

multiple-trace versus strength models, 
136-137 

Type I and Type II, 144-145, 256 
Rehearsal buffer, 246 
Rehearsal response, 208 
Reinforced practice, 5 
Reinforcement, 5—6, 154-183 
amount of, 168-172 
definitions and parameters of, 154-155 
delay of, 165—168 
drive-reduction interpretation of, 
130-131 

physical basis of, 180-183 
positive versus negative, 46—47 
schedules of, 155—165 
theories of, 176-180 
Reinforcement contrast effect, 170-171 
Reinforcement schedules, 154-165 
autoshaping, 162-163 
chained schedules, 160-161 
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concurrent schedules and the matching 
law, 161 

fixed-interval, 155-156 
fixed-ratio, 157—158 
of humans and animals, 164-165 
mixed schedules, 159-160 
partial-reinforcement effect, 163—164 
schedule-induced behavior, 161—162 
tandem, 160—161 
variable-interval, 156—157 
variable-ratio, 158—159 
Reinforcers, defined, 154 
Relational theory, 201 
Relaxation responses, 71 
Release-from-PI effect, 273—274 
Rescorla-Wagner model, 35 
Respondent conditioning, 7 
Response change neural system, 181 
Response competition, 229 
Response-deprivation hypothesis, 179 
Response differentiation, 209 
Response generalization, 216 
Response hold neural system, 181 
Response interchangeability, 82 
Response learning, 207 
Response probability, 131 
Responses, 116-117 

Response-set interference, 231—232, 234 
Response theory, 179 
Retention, 110 
Retrieval, 242, 243 
Retroactive facilitation, 226 
Retroactive inhibition (RI), 226—227 
STS and, 273 

Retrograde amnesia effect, 236—237 
Rewards, 5—6, 36-39 
Reward training, 36-39 
Rewriting, as a memory aid, 304 
Rhesus monkeys, reward training and, 36 
Rules 

defined, 332 

rule complexity, 334 

rule learning, 333 

S 

Scanning experiments, 3 10—311 
Schedule-induced behavior, 161 — 162 
Schemata, 382-386 
Secondary generalization, 195—196 
Secondary memory (SM), 244—245, 246 
Secondary reinforcers, 165—168, 172—176 
as carriers of information, 173 — 175 
motivating properties of, 175 
as undetected primary reinforcers, 175 
Second-order conditioning, 34 
Self-directed intervention, 50—5 1 
Self-reference, 142 


Semantic coding, 274-276 
Semantic conditioning, 25 
secondary generalization and, 195-196 
stimulus generalization and, 177 
Semantic integration, 376 
Semantic memory, 284-287, 302-303 
Semantic-network approach, 244, 364 
Semantic-network models, 260—268 
common characteristics of, 260 
feature models, 264—266 
hierarchies, 260—264 
matrices, 264 

spreading-activation models, 266-268 
Semantic processing, 140 
Sensory memory, 248—251 
echoic memory, 250—251 
iconic memory, 249-250 
Sensory preconditioning, 34 
Sensory register, 246, 248-251 
Sentences, 364-371 
phrase-structure grammar, 365-368 
surface and deep structure 368-369 
transformational grammar, 369-371 
verbatim versus meaning memory, 369 
Sentence-verification technique, 371 
Separate store models, 242, 243—248 
Atkinson-Shiffrin buffer model, 
246-248 

primary and secondary memory, 
244-246 

summary model of, 248—256 
Sequential theory, 164 
Serial anticipation learning, 15 
Serial compound (CS), 32 
Serial compounding, 34-35 
Serial (sequential) learning, 14-15 
Serial pattern design, 15 
Serial-position effects, 276—278 
Serotonin, 183 
Shadowing, 75 
Shaping, 36-37 

instrumental conditioning of language 
and, 362 

Short-term memory, 121—122 
Short-term store (STS), 245, 246, 
251-252 

control processes and structural 
properties and, 253—256 
interference effects in, 272—274 
versus long-term store, 272, 278—280 
negative recency, 277—278 
primacy effect, 276 
recency effect, 276—277 
Sign stimuli, 95, 101 
Simultaneous classical conditioning, 27 
Skinner box, 10, l /, 36 
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Spatial memory, 290 

Species-specific defense reactions (SSDRs), 
92-93 

Specific attributions, 42 
Specific response competition, 231 
Specific transfer, 211 
negative, 212—213 
similarity effects, 213—215 
Spontaneous recovery, 73—74, 231 
Spreading-activation theory, 266—268, 
364 

Stable attributions, 42 
State-dependent memory, 294 
Stimulus, 114—116 
Stimulus-change theory, 178 
Stimulus control, 198 
Stimulus dimension, 193, 194 
Stimulus encoding variability, 233-234 
Stimulus generalization, 75—77, 216 
Stimulus generalization gradient, 76, 190 
Stimulus intensity dynamism, 131 
Stimulus interchangeability, 82 
Stimulus learning, 207 
Stimulus-quality theory, 179 
Stimulus reversal procedure, 203 
Stimulus-sampling model, 133—135 
Stimulus selection, 308 
Stimulus-suffix effect, 250 
Stimulus theories, 178—179 
Stimulus trace, 115 — 116 
Stop thought, 46 
Storage, 242, 243 
Strength model, 136—137 
Structural processing, 140 
Subjective organization (SO), 117—118, 
307-308 

Summation, 77—78 
Supernormal stimuli, 95—96 
Syllables, words, and prose, 20-21 
Synaptic transmission, 181—183 
Systematic desensitization, 71—73 

T 

Tachistoscope, 249 
Tactile sensory memory, 248, 249 
Tandem schedules, 160—161 
Taste-aversion effect, 88—92 
novelty and, 90 
Taste-aversion learning, 24 
Telegraphic speech, 362 
Temporal classical conditioning, 27 
Temporal contiguity of stimulus and 
response, 59-60. See also 
Contiguity 
Thumb sucking, 38 

Tip-of-the-tongue phenomenon, 285-286 
Token economies, 38 


Tokens, 175—176 
Total time law, 139 
Trace classical conditioning, 27, 89 
Trace conditioning, 120 
Transfer, 110, 187 

in problem solving, 352—353 
RI and PI and, 227 

Transfer and retroaction surface, 227—228 
Transfer of training, 187, 210—216 
generalization and, 215—216 
nonspecific transfer, 211—212 
specific transfer, 211 
Transformational grammar, 369—371 
Transitional error probabilities, 366—367 
Transposition effect, 201—202 
Trials to criterion, 124 
Trichotillomania, 45-46 
Type I rehearsal, 144-145, 256 
Type II rehearsal, 144-145, 256 

U 

UCS preexposure effect, 33 
Unconditioned response (UCR), 7, 24 
CR and, 29 

Unconditioned stimulus (UCS), 7, 24 
intensity of, 32 
Universal attributions, 42 
Unlearning, 229 
versus competition, 229—230 
encoding variability and, 233—234 
as extinction, 230—231 
independent retrieval and, 232—233 
Unprepared instrumental conditioning, 

83-84 

Unstable attributions, 42 

V 

Vacuum reaction, 95 
Variable-interval (VI) schedule, 156—157 
Variable-ratio (VR) schedule, 158—159 
Verbal coding, 315—316 
Verbal conditioning, 209 
Verbal-discrimination learning, 207—209 
Visual coding, 315—316 
Visual sensory memory, 248, 249 
Voluntary responses, 30, 60 

w 

Warm-up, 212 

Washoe, 387-388 

“Water jug” problem, 352-353 

Word associations, 301—303 c 

Words, phonemes and morphemes and, o 9 

363-364 E o 

F 1 
G 2 
H 3 
I 4 
J 5 
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